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^ ABSTRACT 

We present the Canada-France-Hawaii Telescope Lensing Survey (CFHTLenS) that accu- 
■'"^ rately determines a weak gravitational lensing signal from the full 154 square degrees of deep 

multi-colour data obtained by the CFHT Legacy Survey. Weak gravitational lensing by large- 
• scale structure is widely recognised as one of the most powerful but technically challenging 

^\ probes of cosmology. We outline the CFHTLenS analysis pipeline, describing how and why 

^ every step of the chain from the raw pixel data to the lensing shear and photometric redshift 

measurement has been revised and improved compared to previous analyses of a subset of the 
same data. We present a novel method to identify data which contributes a non-neghgible con- 
tamination to our sample and quantify the required level of calibration for the survey. Through 
a series of cosmology-insensitive tests we demonstrate the robustness of the resulting cosmic 
shear signal, presenting a science-ready shear and photometric redshift catalogue for future 
exploitation. 
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1 INTRODUCTION 



* heymans@roe.ac.uk Our understanding of the Universe has grown rapidly over the past 
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independent and diverse observations all point to a Universe dom- 
inated by dark energy and dark matter. The concordant cosmology 
derived from these observations accurately determines the compo- 
sition of the Universe (see review article [Weinberg et al. 12012] and 
references therein) and the highest priority is now to understand the 
different phenomena that comprise what is often referred to as the 
Dark Universe. 

Weak gravitational lensing is a unique tool for cosmology 
which directly probes the mass distribution of matter in the Uni- 
verse independent of its state or nature. As light propagates through 
the Universe its path becomes deflected by the gravitational poten- 
tial of the large-scale structures of matter, with the consequence that 
distant galaxy images are observed to be weakly and coherently 
distorted. This observation can be directly related to the underlying 
matter power spectrum of the Universe (see for example |Schrab-| 
[back et al.|2010 1 and can pinpoint where matter is and how much 



of it there is ( [Massey et al. 2007b , Heymans et al. 2008 1. Compared 
to other probes of cosmology, weak lensing is particularly interest- 
ing as it provides a direct measurement of the growth of large-scale 
structures in the Universe allowing us to test the fundamental and 
alternative gravity theories suggested to explain the dark energy in 
the Universe jReyes et al.|2010||Simpson et al.|20T2l l. 

Weak lensing by large-scale structure is widely recognised as 
one of the most powerful but technically challenging probes of cos- 
mology. Since its first detection (Bacon et al. 2000, Van Waerbeke 
|et al.|20()0{|Wittman et al. 2000j advances in technology and deep- 
wide survey observations have been paralleled by significant com- 
munity efforts in algorithm development in order to overcome the 



Massey et al. 2007a( 


Bridle et al. 2010, Hildebrandt et al. 2QW, 


Kitching et al. ,2012a 


bj. The measurement requires the detection 



of percent level shear distortions imprinted on the images of dis- 
tant galaxies by cosmological lensing in the presence of temporal 
and spatially varying ~ 10% distortions caused by the atmosphere, 
telescope and detector. The growth in precision of lensing surveys 
over the past decade has required an increasing accuracy in the un- 
derstanding of the origins of the distortions and the impact of data 
reduction methods on our shear measurement algorithms. These lo- 
cal effects are encompassed in the point spread function (PSF) as 
measured from images of stellar objects in the survey. 

In this paper we present the Canada-France-Hawaii Telescope 
Lensing Survey (CFHTLenS) that accurately measures weak gravi- 
tational lensing from the deep multi-colour data obtained as part of 
the CFHT Legacy Survey (CFHTLS). This survey spans 154 square 
degrees in five optical bands u* g'r'i' z' with a 5a point source lim- 
iting magnitude in the i'-band of ~ 25.5. CFHTLenS incorpo- 
rates data from the main Wide survey. Deep survey, the astrometric 
pre-imaging and photometric calibration post-imaging components 
of the CFHTLS which completed observations in early 2009. The 
first weak lensing analysis of the CFHTLS-Wide jHoekstra et al.| 
20061 analysed 3 1 square degrees of single band i' data, showing 



the high quality imaging obtained by the then newly commissioned 
one square degree field of view MegaCam imager on the 3.6m 
CFHT. This conservative analysis selected regions of the data far 
from the boundaries of the individual CCD chips in the MegaCam 
imager, omitting one-third of the data. This strategy was used in 
order to circumvent issues associated with the stacked combination 
of varied PSFs from different chips in the 7 dithered exposures in 
these regions. This first analysis was followed by |Fu et al.|p008) 
where the 57 square degrees of the survey analysed had, for the 
first time, sufficient statistical accuracy to reveal hints of system- 
atic errors in the measured shear signal on large scales. Significant 



variations of the shear signal between individual MegaCam point- 
ings were then uncovered i jKilbinger et al.|2009^ . This hinted at a 
problem with understanding the PSF even though a similarly con- 
servative masking scheme had been applied to reject the problem- 
atic image regions excised in the |Hoekstra et al.| ( |2006) l analysis. In 
addition a problem with the redshift-scaling of the two-point shear 
correlation function soon became apparent when the data used in 
|Fu et al.| p008^ were combined with photometric redshift mea- 
surements from [Coupon et aLl ( |2009^ , (see [Kilbinger et al.|2009[ 
for more details). The CFHTLenS collaboration formed to develop 
new techniques and find a solution to reduce these systematic er- 
rors. 

In this paper we start in Section[2|by outlining the CFHTLenS 
analysis pipeline and the importance of working with individual 
exposures rather than stacked images. Parts of the pipeline are pre- 
sented in much finer detail in |Erben et al.|p()I2[ pixel analysis), 
Hildebrandt et al. ( |2012[ photometry and redshift analysis), and 
|Miller et al.[j2012l shear analysis). We then detail the methodol- 
ogy behind our cosmology-insensitive and quantitive systematic 
error analysis in Section |3] and present the results of that analy- 
sis, calibrating and selecting a clean data sample, in Section[4| We 
investigate the impact of the calibration and data selection on the 
two-point shear correlation function in Section |5] comparing our 
systematic error analysis to alternative tests advocated by previ- 
ous weak lensing analyses. We additionally show the robustness 
of the measurements by performing a final demonstration that the 
shear signal is not subject to redshift-dependent biases in Section[6| 
using a cosmology-insensitive galaxy-galaxy lensing test. Finally 
we conclude in Section |7] Throughout this paper the presented er- 
rors show the la confidence regions as computed through bootstrap 
analyses of the data, unless stated otherwise. 



2 THE CFHTLENS ANALYSIS PIPELINE 

The CFHTLenS collaboration set out to robustly test every stage 
of weak lensing data analysis from the raw pixel data through data 
reduction to object detection, selection, photometry, shear and red- 
shift estimates and finally systematic error analysis. For each step 
in the analysis chain, multiple methods were tested and rejected 
or improved such that the final CFHTLenS pipeline used in this 
analysis re-wrote every single stage of the data analysis used in 
earlier CFHTLS analyses, and indeed the majority of all previous 
weak lensing analyses. We argue that there was not a single distinct 
reason why earlier CFHTLS lensing analyses suffered from sys- 
tematic errors by discussing a series of potential successive sources 
of systematic error that could have accumulated throughout earlier 
analyses. We detail each step in the analysis chain in this section 
for users and developers of weak lensing analysis pipelines. For 
those readers interested in the main changes a summary is provided 
in Table [Tj which compares the key stages in the lensing analysis 
pipelines used by CFHTLenS and in an earlier CFHTLS analysis 
by [Fu et al. ( 2008 1. Note that the |Fu et al.| ( |2008| l pipeline is a good 
example of the standard methods that have been used by the major- 
ity of weak lensing analyses to date (c. 2012). One notable excep- 
tion is [Bernstein & Jarvis| ( |2002 jl who advocate the weak lensing 
analysis of individual exposure data rather than stacks, by averag- 
ing the shear measurements from multiple exposures on a catalogue 
level. The extension of this exposure-level analysis argument to the 
more optimal, simultaneous, joint model-fitting analysis of multi- 
ple exposures, has been one of the most important revisions in our 
analysis. 
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Table 1. Comparison of the key stages in the lensing analysis pipelines used by CFHTLenS and in an earlier CFHTLS analysis by |Fu et al.|p008) . Note that 
the earlier CFHTLS pipeline is a good example of the standard methods that have been used by the majority of weak lensing analyses to date (c. 2012). 



Pipeline Stage 



Fu et al. ■ 2008 ) CFHTLS Pipeline 



CFHTLenS Pipeline 



Astrometry reference catalogue 
Cosmic Ray Rejection method 

Photometry measured on 
Star-galaxy size-mag selection 

Redshift distribution 
Shape Measurement method 
Shapes measured on 
Systematic error analysis 



USNO-Bl 

Median pixel count in stack 

Re-pixelised median stacks 

Averaged over each field of view (Pipeline I) 

Chip-by-chip selection (Pipeline II) 

Extrapolation from CFHTLS-Deep fields 

KSB-l- 

Re-pixelised median stacks 
Averaged results for full survey 



2MASS & SDSS DR7 

SEXTRACTOR neural-network detection 

refined to prevent the misclassification of stellar cores 

Re-pixelised mean stacks with a gaussianised PSF 

Chip-by-chip selection 

plus additional colour selection 

From BPZ photometric redshift measurements of each galaxy 
/enifit 

Individual exposures with no re-pixelisation 
Exposure level tests on individual fields 



2.1 CFHTLS Data 

The Cl^HTLS-Wide data spans four distinct contiguous fields: Wl 
(~ 63.8 square degrees), W2 (~ 22.6 square degrees), W3 (~ 44.2 
square degrees) and W4 (~ 23.3 square degrees). The survey strat- 
egy was optimised for the study of weak gravitational lensing by re- 
serving the observing periods with seeing better than ~ 0.8 arcsec 
for the primary lensing i'-band imaging. The other u*g'r'z' bands 
were imaged in the poorer seeing conditions. A detailed report of 
the full CFHTLS Deep and Wide surveys can be found in the TER- 
APIX CFHTLS T0006 release documenQ All CFHT MegaCam 
images are initially processed using the ELIXIR software at the 
Canadian Astronomical Data Centre (Magnier & Cuillandre 20041. 
We use the ELIXIR instrument calibrations and de-trended archived 
data as a starting point for the CFHTLenS project. 



2.2 Data Reduction with THELI 

The CFHTLenS data analysis pipeline starts with the public 
THELI data reduction pipeline designed to produce lensing quality 
data ( |Erben et al. 2005) and first applied to the CFHTLS in |Erben| 
|et al.| l 2009^ . We produce co-added weighted mean stacks for object 
detection and photometry and use single exposure i'-band images 
for the lensing analysis. Relevant pixel quality information in the 
form of weights is produced for each image, both for the stacks and 
the individual exposures. 

Improvement of the cosmic ray rejection algorithm was one of 
the key developments in THELI for CFHTLenS. We tested the ro- 
bustness of the neural-network procedure implemented in the SEX- 
TRACTOR software package (Berlin & Arnouts 1996) to identify 
cosmic ray hits using the default MegaCam cosmic ray filter created 
by the EyE (Enhance your Extraction) software package ( [Bertin| 
|2001 |. We found this set-up to reliably identify cosmic ray hits but 
at the expense of the misclassification of the bright cores of stars 
in images with a seeing better than ~ 0.7 arcsec. In images with a 
seeing of < 0.6 arcsec, nearly all stars brighter than Jab ~ 19 had 
cores misclassified as a cosmic ray defect. This misclassification is 
particularly problematic for lensing analyses that analyse individ- 
ual exposures as a non-negligible fraction of stars used for model- 
ing the PSF are then rejected by the cosmic ray mask. In addition 
this rejection may not be random, for example the neural-network 
procedure may preferentially reject pixels in the cores of the stars 
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with the highest Strehl ratio thus artificially reducing the average 
Strehl ratio of the PSF model in that region. For lensing analyses 
that use co-added stacks, this cosmic ray misclassification is also 
an issue when using mean co-added images of the dithered expo- 
sures. The misclassified exposure level pixels are, by definition, the 
brightest at that location within the stack. When the cosmic ray 
mask removes these misclassified pixels from the co-addition, the 
centres of stars therefore artificially lack flux which produces an 
error in the PSF model. 

In earlier analyses of CFHTLS, concerns about cosmic ray 
rejection were circumvented by using a median co-added image 
of the dithered exposures, even though this method does not max- 
imise the signal-to-noise of the final co-added image and can only 
be applied to images with enough exposures. In this case whilst 
the misclassification of cosmic rays is likely no longer an issue, a 
more subtle PSF effect is at play. As the median is a non-linear 
statistic of the individual pixel values, it destroys the convolutional 
relationship between the stars and galaxies in the stacked images 
such that the PSF from the stars differs from the PSF experienced 
by the galaxies. This can be illustrated by considering the case of 
four dithered exposures, three with similar PSF sizes and elliptic- 
ities and one exposure with a PSF with fainter extended wings. In 
the median co-added image, the PSF modeled from the stellar ob- 
jects will match the PSF in the first three exposures, as the fourth 
exposure is rejected at the location of the stars by the median opera- 
tion on the image. The PSF as seen by the fainter extended galaxies 
will however include the effects of the extended wing PSF from 
the fourth exposure. This is because pixel noise and the broader 
smoothing from the extended fourth PSF exposure can combine in 
such a way that the fourth exposure determines the final pixel count 
at some locations across the median co-added galaxy image. In the 
CFHTLS there are typically 7 exposures per image with significant 
variation of the PSF between exposures such that the previous use 
of median co-added images could well have contributed to the sys- 
tematic error found in earlier CFHTLS analyses. 

For the current work we refined the standard procedures for 
cosmic ray flagging by identifying cosmic rays in a two-stage pro- 
cess. First we identify cosmic rays using the neural-network pro- 
cedure in the SEXTRACTOR software package, as described above. 
We then extract a catalogue of bright sources from the data us- 
ing SEXTRACTOR with a high detection threshold set, requiring 
more than 10 pixels to be connected with counts above lOcr. Can- 
didate unsaturated stars on the image are then selected using the 
automated THELI routine which locates the stellar locus in the 
size-magnitude frame. We then perform a standard PSF analysis to 
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clean the bright candidate star catalogue using the 'Kaiser, Squires 
and Broadhurst' method l |Kaiser et al.||1995] hereafter KSB+). 
This consists of measuring second order brightness moments and 
performing a two-dimensional second order polynomial iterative 
fit to the PSF anisotropy with outliers removed to obtain a clean 
bright star catalogue. As this particular stellar sample is bright, this 
method is very effective at selecting stars. All bright stellar sources 
that remain in the sample in the final fit are then freed from any 
cosmic ray masking in the first step. This procedure was found to 
produce a clean and complete cosmic ray mask that left untouched 
the bright end of the stellar branch required for the more thorough 
CFHTLenS star-galaxy classification and subsequent PSF analy- 
ses (see Section|23J. The method will, however, allow through the 
very rare cases of real cosmic ray defects at the location of stars. In 
these cases, the stellar objects will likely be flagged as unusual in 
the colour-colour stellar selection stage that follows, as described in 
section [23] Further properties of this method are detailed in |Erben| 
let al. 1 1 ,20 12, 1. 

In addition to advances in cosmic ray identification, THELI 
was updated to perform photometric and astrometric calibrations 
over each survey patch (W1,W2, W3 and W4) aided by the sparse 
astrometric pre-imaging data and photometric calibration post- 
imaging data (programs 08AL99 and 08BL99), in contrast to ear- 
lier calibration analyses on a square degree MegaCam pointing ba- 
sis. The resulting field-to-field RMS uncertainty in relative pho- 
tometry is (T ~ 0.01 — 0.03 magnitude in all passbands. The uni- 
form internal astrometry has a field-to-field RMS error a ~ 0.02". 
Significant effort was invested in testing the impact of using dif- 
ferent reference catalogues (2MASS with SDSS-DR7 was found to 
be the most accurate) in addition to the robustness of the SCAMP 
astro metric software |Bertin 2006. 1 and SWARP re-pixelisation soft- 
ware jBertin et al.|2002) , and the impact of the astrometric distor- 
tion correction, interpolation and re-pixelisation on the PSF and 
galaxy shapes using simulated data. The conclusion of this work 
was that whilst these methods were excellent for astrometry they 
were not sufficiently accurate for shape measurement, particularly 
if the imaging was undersampled. For this reason we do not interpo- 
late or re-pixelate the data used for our lensing analysis. Instead we 
apply the derived astrometric distortion model to the galaxy mod- 
els when model fitting to the data (Miller et al. 20121. Finally, the 
automated THELI masking routine was applied to the data to iden- 
tify saturated stars, satellite trails, ghosts and other artifacts. These 
masks were individually expected, verified and improved manu- 
ally, and the importance of this manual inspection is investigated in 
Section [53] A more detailed description of the THELI analysis in 
CFHTLenS is presented in |Erben et al.|p0T2l l. 

2.3 Object Selection, PSF characterisation and Photometric 
Redshifts with Gaussianised Photometry and BPZ 

The next stage in the CFHTLenS data analysis pipeline is object 
detection and classification. We use the SEXTRACTOR software 
( |Bertin & Amouts||l996| l to detect sources in the i'-band stacks. 
This initial catalogue forms the basis for the shape and photometric 
measurements that follow. Using the i'-band as the primary detec- 
tion image preserves the depth of the co-added data for our lensing 
population and it is this complete initial catalogue that is carried 
through the full pipeline with flags and weights added to indicate 
the quality of the derived photometric and shape parameters. 

We manually select stars in the size-magnitude plane. In this 
plane the stellar locus at bright magnitudes and fixed constant size, 
governed by the PSF, can be readily identified from the galaxy pop- 



ulation. Stars are selected down to 0.2 magnitudes brighter than 
the magnitude limit where the stellar locus and galaxy population 
merge. We found the manual selection was necessary to select a 
sufficient number of reliable stars at magnitudes fainter than the 
bright stellar sample that was automatically selected by the THELI 
routine and used to improve the cosmic ray rejection algorithm (see 
Section [2!2| . The manual selection is performed on a chip-by-chip 
basis as the stellar size varies significantly across the field of view. 
If the star-galaxy separation is made over the whole field this can 
result in truncation in the stellar density at the extremes of the stel- 
lar locus in the size-magnitude plane. This would then result in a 
lack of stars in the corresponding regions in the field of view and 
hence a poor PSF model in those regions. This type of full field se- 
lection was part of the main Pipeline I CFHTLS analysis of |Fu et ak] 
l |2008| , with a manual chip-based selection reserved only for those 
fields which exhibited an unusually wide stellar locus scatter. The 
upcoming launch of the ESA-Gaia missiorj^to survey over a bil- 
lion stars in our galaxy will mean that this manual stellar selection 
stage will not be required in the future. 

For lensing studies we require a pure and representative star 
catalogue across the field of view that does not select against a par- 
ticular PSF size or shape. Provided the catalogue is representative it 
does not need to be complete. To ensure the purity of the sample se- 
lected in the size-magnitude plane we perform an additional colour 
analysis. We first construct a 4D colour space; g' — r', r' — i', i' — z' 
and g' ~ z' , and determine the distribution of stellar candidates in 
this space. A stellar candidate is confirmed to be a star when more 
than five percent of the total number of candidates lie within a dis- 
tance of ~ 0.5 magnitudes from its location in the 4D space. We 
reject any object that lies below this threshold density, typically re- 
jecting about ten percent of the original, manually selected, stellar 
candidate list. 

An accurate spatially varying pixelised PSF model in each ex- 
posure of the lensing i'-band data was created using the pure star 
catalogue for use in the lensing analysis. Each PSF pixel value was 
modeled using a two-dimensional third-order polynomial function 
of position in the camera field of view. To allow for the discon- 
tinuities in the PSF across the boundaries between CCDs, some 
coefficients of the polynomial are allowed to vary between CCDs 
(see |Milleretal.|2 012 for details). Note that the width of the CFHT 
z'-band filter is sufficiently narrow that we can assume the PSF is 
independent of the star colour ( [Cypriano et al.|2010| |Voigt et aT] 
|2012), which is shown to be a good assumption for the i'-band 
in Guy et ar] ( |20I0[ l. We also ignore the differing affects of atmo- 
spheric dispersion on the PSF of stars and galaxies. The wavelength 
dependence of the refractive index of the atmosphere will induce 
a spectrum dependent elongation of the object. For the low air- 
mass i'-band observations of the CFHTLS, however, the difference 
in this elongation for stars and galaxies is shown to be negligible 
l |Kaiser|2000^ . We use unweighted quadrupole moment measures 
of the resulting high signal-to-noise pixelised PSF models to cal- 
culate a measure of the PSF ellipticity e* at the location of each 
object (see equation 3 in |Heymans et al.|2006a^ . These PSF ellip- 
ticity estimates are only used in the systematics analysis detailed in 
Section|3] 

The pure star catalogue is also used to measure accurate 
multi-band photometry by constructing spatially varying kernels in 
u* g'r'i' z' which are used to gaussianise the PSF in the mean co- 
added image of each band. This data manipulation is used only for 



^ Gala: gaia.esa.int 
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photometry measurements, and results in a circular Gaussian PSF 
that is constant between the bands and across each field of view (see 
[Hildebrandt eTaTjlMIl for more details on PSF gaussianisation 
for photometry). Object colours are determined by running SEx- 
TRACTOR in dual-image mode on the homogeneous gaussianised 
u* g'r'i' z' images, and this photometry is then analysed using the 
Bayesian Photometric Redshift Code fBemtez 2000 BPZ) with a 
modified galaxy template set, stellar template set and a modified 
prior as detailed in |Hildebrandt et ^ ( |2012[ l. 

So far we have discussed how to obtain a pure stellar catalogue 
but for our science goals we also need a pure source galaxy cata- 
logue, which again need not necessarily be complete. To avoid any 
potential bias introduced by the manual star selection we use lensfit 
( [Miller et al.|2012| l to measure shapes for every object detected with 
Jab < 24.7. The rationale for this SEXTRACTOR MAG_AUTO 
measured magnitude limit is described below. Objects are fit with 
galaxy and PSF models and any unresolved or stellar objects are 
assigned a weight of zero in the final shape catalogue. The purity 
of the resulting galaxy catalogue is confirmed by comparing with 
a photometry only analysis. We use BPZ to compare the multi- 
band photometry of each object with galaxy and stellar templates 
and classify stars and galaxies based on the maximum likelihood 
found for each object template and the size of the object relative to 
the sizes of stars as defined in our initial stellar selection ( |Hilde-| 
[brandt et al.|2012) . We find these two methods agree very well in 
creating a pure galaxy sample with less than 1% of objects having 
a different object classification in each method. These differences 
occur at faint magnitudes where the low weight in the lensing anal- 
ysis means that if these objects are truly stars, they would have a 
negligible impact on the lensing signal. 

The input pure galaxy catalogue is also required to be free of 
any shape-dependent selection bias. This could arise, for example, 
if there is a preference to extract galaxies oriented in the same direc- 
tion as the PSF ( |Kaiser|2000^ or preferentially extract more circu- 
lar galaxies such as those that are anti-correlated with the lensing 
shear ( [Hirata & Seljak|[2003[ >. [Heymans et al.| ( |2006a) concluded 
that for the SEXTRACTOR algorithm used in this analysis, selec- 
tion bias was consistent with zero change in the mean ellipticity of 
the population, and at worse introduced a very weak sub-percent 
level error on the shear measurement. We therefore do not consider 
object detection selection bias any further in our analysis. 

The accuracy of the resulting photometric redshifts for the 
bright end of our pure galaxy sample can be established by com- 
paring photometric redshift estimates to spectroscopic redshifts in 
the field ( |Hildebrandt et al.|2012^ . The public spectroscopic red- 
shifts available in our fields come from the Sloan Digital Sky Sur- 
vey (SDSS,| Ap"zajian et al.|2"009t , the VIMOS VLT deep survey 
(VVDS, Le Fevre e t al.|2005| l and the DEEP2 galaxy redshift sur- 
vey ( jNewman et al. 2012 1. For these surveys, the spectroscopic 
completeness is a strong function of magnitude with the faintest 
limits yielding ~ 100 spectra at i'j^g ~ 24.7, according to our 
CFHTLenS magnitude estimates. We choose to limit all our shape 
and photometric redshift analysis to a magnitude i'^^g ~ 24.7 
even though the imaging data permits one to detect and deter- 
mine photometry and shapes for objects at slightly deeper mag- 
nitudes. The rationale here is that all CFHTLenS science cases 
require accurate photometric redshift information and we wish to 
avoid extrapolating the redshift measurements into a regime with 
essentially no spectroscopic redshifts. That said, the completeness 
of the spectroscopy degrades significantly for i'j^g > 22.5. We 
therefore cannot trust that the accuracy predicted from a stan- 
dard photometric-spectroscopic redshift comparison at magnitudes 



fainter than i[^g > 22.5 is representative of the full galaxy sample 
at these magnitudes. In order to address this issue we implement a 
rigorous analysis to test the accuracy of the photometric redshifts 
to the faint ~ 24.7 magnitude limit in Benjamin et al. ( 2012 1. 



In this analysis we split the full galaxy sample into six photometric 
redshift bins and compare the redshift distributions as determined 
from the sum of the BPZ photometric redshift probability distri- 
butions, a cross-bin angular galaxy clusteri ng technique ^Benjamin) 
|etal.|2010| l and the COSMOS-30 redshifts ( |Ilbert et al.|2009t . From 
this analysis we conclude that, when limited to the photometric red- 
shift range 0.2 < z < 1.3, our photometric redshift error distri- 
bution is sufficiently well characterized by the measured redshift 
probability distributions, for the main science goals of the survey 
( [Benjamin et al.|2012[ l. We can therefore incorporate the redshift 
probability distributions in scientific analyses of CFHTLenS in or- 
der to account for catastrophic outliers and other redshift errors. 
Finally, with a magnitude limit fainter than i'j^g = 24.7 objects are 
detected in the lensing i'-band at signal-to-noise ratios less than 7a 
which is a regime where even the best shape measurement methods 
become very strongly biased ( [Kitching et al.|2012a) . 

2.4 Shape Measurement Method Selection 

In the early stages of the analysis, the CFHTLenS team tested a 
variety of different shape measurement methods including two dif- 



ferent versions of KSB+ (most recently used in |Schrabback et aL 
[2010[[Fu et al.|2008[ l, three different versions of Shapelets {Km- 



ljken|2006[[Massey et al.[2007c[[Velander et al.'201 l|l and the model 



fitting method lensfit ^Miller e t al. 2007; Kitching et al.[ [2008[ 
Mille r et al.|2012| . Out of all the measurement algorithms tested 
by CFHTLenS, only lensfit had the capability to simultaneously 
analyse data on an individual exposure level rather than on higher 
signal-to-noise stacked data where the PSF of the stack is a com- 
plex combination of PSFs from different regions of the camera. 
During the early multi-method comparison analyses, it became ap- 
parent how important this lensfit capability was. In principle, all 
other tested algorithms could have also operated on individual ex- 
posures and averaged the results on a catalogue level, as advocated 
by Bernstein & Jarvis ( 2002). As these methods apply signal-to- 
noise cuts, however, (see for example Table Al in [Heymans et al.[ 
[2006a| , this would have yielded a low galaxy number density in 
comparison to the stacked data analysis. The signal-to-noise is typ- 
ically decreased by roughly 40% on the exposure image compared 
to the stack. Additionally, all shape measurement methods are ex- 
pected to be subject to noise bias which increases as the signal-to- 
noise of the object decreases ( [Melchior & Viola|2012| . Averaging 
catalogues measured from individual lower signal-to-noise expo- 
sures is therefore expected to be more biased than a single pass 
higher signal-to-noise simultaneous analysis of the exposures, as 
carried out by lensfit. 

We initially focused on finding a method which did not pro- 
duce the significant variations of the large-scale shear signal be- 
tween individual MegaCam pointings as discussed in [Kilbingerj 
[et al.|p009| . We rapidly came to the conclusion that only the model 
fitting lensfit analysis could produce a robust result. In the case of 
KSB-I-, the PSF is assumed to be a small but highly anisotropic 
distortion convolved with a large circularly symmetric seeing disk 
( [Kaiser etal.|1995| l. In the CFHTLenS stacks the PSF does not meet 
these assumptions so it is not surprising that the KSB+ method fails 
on the stacked data, where the PSF varies significantly between ex- 
posures. In addition some of the CFHT PSF distortion arises from 
coma which also cannot be modeled by the assumed KSB+ PSF 
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profile. For the Shapelet methods we found the signal-to-noise 
of the shear measurements to be very low in comparison to the 
other shape measurement methods tested. This is in contrast to a 
successful application of the SHAPELET method to space-based 
observations where the higher order moment analysis is able to 
take advantage of the additional resolution ( [Velander et al.|2011| l. 
The 'Kuijken' ('2006 1 SHAPELET method is, however, used in the 
CFHTLenS analysis for gaussianising the PSFs for optimal pho- 
tometry (see Section [23] and [Hildebrandt et al.|2012[ l. In the case 
of multi-band photometry the gain in computational speed that re- 
sults from the analytical convolutions that are possible within the 
SHAPELET formalism means using Shapelets for gaussianisa- 
tion is preferable to using the slower, but potentially more exact, 
pixel based model of the PSF as used by lensfA. Work is ongoing 
to see whether the SHAPELETS PSF gaussianisation method can 
improve the performance of KSB-I-. 

It is natural to ask why systematic errors have not been ap- 
parent when the methods CFHTLenS tested have previously been 
tested in the following blind image simulation analysis challenges: 
Heymans et al.'l'2006a[>; [Massey et al.| ( |2007a^ ; |Bridle et al.| ( (20T0l l; 



Kitching et al. 1 2012ab. This is particularly relevant to ask as KSB-l- 



consistently performs well in these challenges. The answer to this 
lies in the fact that the errors discussed above are a consequence of 
features in the data that have not been present in the image simu- 
lation challenges. These challenges do not contain astrometric dis- 
tortions and either have a constant PSF, or known PSF models and 
stellar locations, with typically low PSF ellipticities. It is interest- 
ing to note that the only shape measurement challenge to simulate 
the relatively strong 10%-level PSF distortions that are typical in 
CFHT MegaCam imaging found significant errors for KSB-l- in this 
regime ( Massey et al. 2007a) . Finally, low-signal-to-noise multiple 
dithered exposures have only recently been simulated and tested 
for use in combination in |Miller et al.| ( [20T2l >, in addition to |Kitch-| 
|ing et al.|j2012a[ l which presented the first deep multi-epoch image 
simulations of non-dithered exposures. These new simulations are 
the first to test the difficulty of optimally co-adding exposures in 
analyses of multiple exposure data such as CFHTLenS. Even with 
this advance, features of multiple exposure data such as gaps and 
discontinuities in coverage and the requirement for interpolation 
of data with an astrometric distortion remain to be tested in future 
image simulation challenges, in addition to a more realistic set of 
galaxy model^ 

2.5 Shape Measurement with lensM 

CFHTLenS is the first weak lensing survey to apply the lensfit 
model fitting method and as such there have been many key de- 
velopments of the algorithm for CFHTLenS. The method performs 
a Bayesian model fit to the data, varying galaxy ellipticity and size 
and marginalising over centroid position. It uses a forward convolu- 
tion process, convolving the galaxy models with the PSF to calcu- 
late the posterior probability of the model given the data. A galaxy 
is then assigned an ellipticity, or shear estimate, e, estimated from 
the mean likelihood of the model posterior probability, marginal- 
ized over galaxy size, centroid and bulge fraction. An inverse vari- 
ance weight w is also assigned which is given by the variance of 
the ellipticity likelihood surface and the variance of the elliptic- 
ity distribution of the galaxy population (see [Miller et al.||2012| 
for more details). A summary of the CFHTLenS improvements to 

^ See for example the GREAT3 challenge: www.great3challenge.info 



the algorithm includes using two-component galaxy models, a new 
size prior derived from high resolution Hubble Space Telescope 
data, a new ellipticity prior derived from well resolved galaxies in 
the SDSS, the application of the astrometric distortion correction 
to the galaxy model rather than re-pixelising the data, and the si- 
multaneous joint analysis of single dithered exposures rather than 
the analysis of a stack. These developments are presented in |Miller| 
|et al.| l |2012 ) along with details of the verification of calibration re- 
quirements as determined from the analysis of a new suite of image 
simulations of dithered low-signal-to-noise exposures. 



2.6 Summary 

Once the THELI data analysis, object selection, redshift estimation 
with BPZ and shear estimation with lensfit are complete, we ob- 
tain a galaxy catalogue containing a shear measurement with an in- 
verse variance weight w and a photometric redshift estimate with a 
probability distribution P{z). The number density of galaxies with 
shear and redshift data is 17 galaxies per square arcmin. The ef- 
fective weighted galaxy number density that is useful for a lensing 
analysis is given by 



1 (E^ 



(1) 



where f2 is the total area of the survey excluding masked regions, 
and the sum over weights Wi is taken over all galaxies in the survey. 
We find ncft = 14 galaxies per square arcmin for the full sample. 
We choose, however, to use only those galaxies in our analyses with 
a photometric redshift estimate between 0.2 < z < 1.3 (see sec- 
tion |23] and |Be^am££rar|20T2}. The deep imaging results in a 
weighted mean redshift for this sample ofz — 0.75, and a weighted 
median redshift for this sample of 2^ = 0.7, as determined from 
the weighted sum of the P{z). The effective weighted galaxy num- 
ber density, in this redshift range, is rics — 11 galaxies per square 
arcmin. This photometric redshift selection ensures relatively accu- 
rate photometric redshifts across the survey with an average scat- 
teij^cr^ ~ 0.04(1 + z) and an average catastrophic outlier rate 
below 4%. Across the selected redshift range, the scatter is always 
less than < 0.055(1 + z) and the catastrophic outliers rate is al- 
ways less than 11% jHildebrandt et al. 2012 1. As detailed in Miller] 
|et al.| (2012), and discussed further in Section [43] any calibration 
corrections applied to the shear measurement are less than 6% on 
average. The task is now to verify the quality of these catalogues 
and select the data that meets the criterion of negligible systematic 
errors for a cosmic shear analysis. The resulting error analysis and 
field selection is also directly relevant for galaxy, group and cluster 
lensing analyses of dark matter haloes. The requirements on sys- 
tematics for these analyses, however, are typically less stringent as 
a result of the azimuthal averaging of the lensing signal which re- 
duces the impact of any PSF residuals. We dedicate the rest of the 
paper to the derivation, application and validation of a set of sys- 
tematics criteria to the CFHTLenS data. 

To conclude this section we refer the reader back to Table [T] 
which compares the differences between the key stages in the lens- 
ing analysis pipeline described above and the pipeline used in an 



^ The scatter, a 2, on the photometric redshifts, Zphoti is calculated from 
a comparison with the VVDS and DEEP2 spectroscopic redshifts, Zapoc- 
The quoted ctz is given by the standard deviation around the mean of A2 = 
(zphot ~ 2apec)/(l + Zspec), after outliers with I A2I < 0.15 are removed. 
See |Hildebrandt et al.|(25l2) for further details. 
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earlier CFHTLS analysis by |Fu et ar] ( |2008) , illustrating how every 
core stage of the pipeline has been re- written for CFHTLenS. This 
was necessary as every stage of the standard pipeline used in pre- 
vious analyses could have introduced low-level systematic errors, 
with the most important errors coming from the analysis of median 
stacked data in comparison to individual exposures. 



3 METHODS: QUANTITATIVE SYSTEMATIC ERROR 
ANALYSIS 

The observational measurement of weak gravitational lensing is 
a challenging task, with the cosmological shear signal 7 that we 
wish to extract being roughly an order of magnitude below the at- 
mospheric and telescope distortion. These artificial sources of dis- 
tortion are encapsulated in the PSF. Astrometric camera shear dis- 
tortion is then applied after the PSF convolution (see [Miller et al.| 
|2012| for further discussion on these different types of distortions). 
We detail the lensfit shear measurement method in IMiller et al.l 
( |2012 l) and verify and calibrate the robustness of the method on 
an extensive suite of realistic image simulations. Whilst this suc- 
cessful demonstration on image simulations is very necessary, it is, 
however, not sufficient to then conclude the method will also yield 
an unbiased measure in its application to data. An example failure 
could result from any data-related feature not included in the image 
simulations that would result, for example, in an inaccuracy in the 
PSF model ([Hoekstra 2004 ; Va n Waerbeke et al.|2005[|Rowe|2010| 
[Heymans et aH2012^ . In this section we therefore develop a proce- 
dure to determine the level of any residual distortions in the shape 
catalogues which result from an incomplete correction for the true 
PSF. In order to distinguish any residual distortions from random 
noise correlations we need to construct an estimator that takes into 
account the different sources of noise in our analysis for which we 
require and develop a set of realistic simulated mock catalogues. 

Throughout the paper when we refer to galaxy ellipticity e, 
galaxy shear 7, PSF ellipticity e* or noise rj on any of these shape 
measures, we consider a complex ellipticity quantity composed of 
two components, for example e = ei + ie2. For a perfect ellipse 
with an axial ratio f} and orientation (f), measured counter clockwise 
from the horizontal axis, ellipticity parameters are given by 



£2 



P-1 f cos 2<;!> 
/3 + 1 V sin 20 



(2) 



As we focus this section on systematics that are related to the PSF, 
we construct a general model for shear measurement with a system- 
atic error term that is linearly proportional to the PSF ellipticity, as 
first proposed by |Bacon et al.|p()03| >. We use the following model 
to determine the level of any residual distortions which would result 
from an incomplete correction for the PSF 



int 1 , I yi T 

= e + 7 + 77 + A e 



(3) 



Here e°^^ is the observed shear estimator, e'"* is the intrinsic galaxy 
ellipticity, 7 is the true cosmological shear that we wish to detect, 
and rj is the random noise on the shear measurement whose am- 
plitude depends on the size and shape of the galaxy in addition to 
the signal-to-noise of the observations. An optimal method applied 
to an optimal survey will yield a random noise distribution that is 
significantly narrower than the intrinsic ellipticity distribution with 
(Tri ^ in the typical signal-to-noise regime of the data. The 
systematic error term in equationjsjis given by e*. Here e* is 
a complex A'^ dimensional vector of PSF ellipticity at the position 
of the galaxy in each of the A'^ dithered exposures of the field. In 



the case where the galaxy is not imaged in a particular exposure, 
as a result of differing chip gap and edge positions in the dithered 
exposures, we set the relevant exposure component of e* equal to 
zero. Asys is the amplitude of the true systematic PSF contami- 
nation which we construct as a vector of length A*' containing the 
average fraction of the PSF ellipticity that is residual in the shear 
estimate in each individual exposura^ For an unbiased shear esti- 
mate A^ys e* = 0. 

We consider a series of different two point correlation func- 
tions 5± using the shorthand notation (a6) to indicate which two 
ellipticity components a and b are being correlated using the fol- 
lowing data estimator: 



i±{e) = {ee) 



Y^w^Wj [et{xi)€t{xj) zfc eyXxi)€yXxj)] 

I] WiWj 



(4) 

where in this particular example we are correlating the observed 
galaxy ellipticities, and the weighted sum is taken over galaxy pairs 
with angular separation \xi — Xj\ — 6. The tangential and cross 
ellipticity parameters et,x are the ellipticity parameters in equa- 
tion l|2j rotated into the reference frame joining each pair of corre- 
lated objects. In the derivation that follows we will use this short- 
hand notation to indicate the correlation between galaxy ellipticity, 
ellipticity measurement noise, PSF ellipticity and shear following 
the same construction of the estimator in equation |4](. We base our 
systematics analysis on this type of statistic as in the case of the 
two point shear correlation {77) it can be directly related to the un- 
derlying matter power spectrum that we wish to probe with weak 
gravitational lensing. 



(77) - ^ 



dnp^{i)j±{ie), 



(5) 



where J± [16) is the zeroth (for 5+ ) and fourth (for ^_ ) order Bessel 
function of the first kind and [tj is the convergence power spec- 
trum at angular wave number £ (see |Bartelmann & Schneider|2001| 
for more details). In the case where there are no systematic errors 
and no intrinsic alignment of nearby galaxies, equation Q is an 
accurate estimate of the right hand side of equation ([5} (see the 
discussion in |Heymans et al.|2006b[[joachimi et al.|2011[ and ref- 
erences therein). 



3.1 Cosmological Simulations: CFHTLenS clone 

In the analysis that follows we quantify the significance of any 
residual systematic error in the data by comparing our results with 
a cosmological simulation of CFHTLenS that we hereafter refer to 
as the 'clone'. The core input of the 'clone' comes from 184 fully 
independent three-dimensional N-body numerical lensing simula- 
tions, where light cones are formed from line of sight integra- 
tion through independent dark matter particle simulations, with- 
out rotation (jHamois-Deraps et al.|2012) . The simulated cosmol- 
ogy matches the 5-year Wilkinson Microwave Anisotropy Probe 
(WMAP5) flat ACDM cosmology constraints from |Duiikley et al^ 
(2009) and we adopt this cosmology where necessary throughout 
the paper, noting that our results are insensitive to the cosmological 



^ For a single exposure image, such that = 1, a circular, unsheared 
galaxy measured with infinite signal-to-noise would have an observed el- 



lipticity e° 



Asys£*, where the exact value of A^ys depends on how 



well the PSF correction has performed. A measurement of Asys from the 
data can determine what fraction of the PSF ellipticity contaminates the 
final shear estimate. 
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model that we choose. Each high resolution simulation has a real 
space resolution of 0.2 arcmin in the shear field and spans 12.84 
square degrees sampled at 26 redshift slices between < z < 3. 
The two-point shear statistics measured in real-space from the sim- 
ulations closely matches the input, dark matter only, theory from 



0.5 < 9 < AO arcmin scales at all redshifts ( Harnois-Deraps et al. 
|2012| . Being able to recover small-scale real-space resolution of 
the simulated shear field is crucial for our comparison analysis of 
systematic errors on these angular scales. 

We use each independent line of sight in the simulation to cre- 
ate different cosmological realizations of each MegaCam field in 
CFHTLenS. We ensure that the galaxy distribution, survey masks 
and redshifts from the data are exactly matched in the simulations, 
and assign galaxy shear 7 from the lensing simulations by linearly 
interpolating between the fine redshift slices in the simulations to 
get a continuous redshift distribution. The sum of the noise rj and 
intrinsic ellipticity distribution e'"' are assigned on a galaxy by 
galaxy basis by randomizing the corresponding measured galaxy 
orientation in the data, such that |e'"' + v\ ~ l^"*^^]- This step as- 
sumes that, on average, the true shear 7 contribution to the observed 
ellipticity is small in comparison with the measurement noise 
r] and intrinsic ellipticity distribution e'"'. Finally, each galaxy in 
the 'clone' is assigned a corresponding PSF ellipticity, for each 
exposure. This is given by the PSF model ellipticity e*, as mea- 
sured from the data, at the location of the galaxy whose position 
in the MegaCam field matches the simulated galaxy position in the 
'clone'. 



3.2 The star-galaxy cross correlation function 

In order to assess the level and significance of PSF-related sys- 
tematics in the data we measure the two-point star-galaxy cross 
correlation function — {e°^^e*) which, using our linear shear 
measurement model (equation[3]l, can be written as 

= (e^'^^e*) = (6'"*e*) + e*) + (rj e*) + CA,,, . (6) 

C here is given by the covariance matrix of PSF ellipticities be- 
tween exposures such that Ctj — {e*e*) with i and j denoting the 
different TV exposures. We assume that Asys, the changing fraction 
of PSF contamination in each exposure, does not vary across the 
field of vie 

The derivation that follows in Section [33] is general for both 
the ^+ and 5_ components of each correlation function and ap- 
plies to any angular separation probed 9, and both a model and an 
observed stellar measurement of the PSF ellipticity. Our primary 
systematics analysis however inspects only the zero-lag star-galaxy 
correlation ^^^{9 = 0), hereafter ^sg(O), using the model of the 
PSF ellipticity to determine e* at the location of each galaxy. At 
zero-lag the estimator in equation |4] for the star-galaxy correlation 
reduces to 

(0) - . (7) 

The motivation for this is as follows: consider a data set where the 
PSF model and correction is exact such that observed galaxy el- 
lipticities are uncorrelated with the PSF and $sg(0) consistent 
with zero. If we cross-correlate the same galaxy ellipticities with 



If j4sys were dependent on position in the image or galaxy properties, for 
example, the method we are proposing to isolate PSF contamination would 
be sensitive to the average value (Asys). 



the PSF ellipticity at some distance 9, ^sg(^) ^'^^ continue to be 
consistent with zero because the galaxies have intrinsically random 
orientations. Instead, now consider a data set where there has been 
an error in the measurement of the PSF model or an error in the 
PSF model correction. In this case the star-galaxy cross correlation 
at the location of the galaxies (0) is now non-zero. At larger sep- 
arations however, ^sg(^) °'' non-zero depending on the 
variation of the PSF autocorrelation function. Hence, for the detec- 
tion of systematics by star-galaxy correlation, we argue that there 
is little information to be gained from measurements of ^sg(^) 
> as it is an error in the local PSF model or local PSF correc- 
tion that creates this form of systematic error. 

In the presence of systematics we would expect to detect a sig- 
nal in the '+' component of the zero-lag star-galaxy cross correla- 
tion. For systematics that are dependent on the ellipticity direction 
we would also expect to detect a signal in the '-' component (see 
equation|7](. The ellipticity direction dependence of any PSF resid- 
uals is, however, expected to be weak, which we confirm in Sec- 
tion|5] Our zero-lag systematic error analysis that follows therefore 
focuses on the '+' component only. We return to the '-' component 
of the two-point correlation function in Section|5] 

To add further to our argument, with a measure of the zero- 
lag star-galaxy correlation ^sg(O) we can use equationjsjto make a 
prediction of the star-galaxy correlation at any angular scal^using 



^sgW^Co'Csg(o)Cfl 



(8) 



where Co is the measured covariance matrix of PSF ellipticities 
between exposures at zero-lag and Cg is the same PSF measure- 
ment but for sources at separation 9. Figure [T] demonstrates this 
by comparing the predicted signal (equation jsrshown as a curve) 
with the star-galaxy cross correlation function S,sg{9) (shown as tri- 
angles) measured in the eight individual exposures in example field 
WImOmO. Seven of the exposures were imaged consecutively. This 
field is typical of the sample of data that passes our systematics tests 
in Section [42| Note that we use a scalar symbol here as we are 
referring to the measurement in each exposure rather than the vec- 
tor which contains the measurement across all exposures. The zero 
separation measure for each exposure ^sg (0) is shown offset in each 
panel (circle). The correlation between the exposures and angular 
scales is shown in the covariance matrix in the upper panel to warn 
the reader that 'chi-by-eye' of this data will fail. Each block shows 
one of the eight exposures and contains a 6 x 6 matrix showing 
the degree of correlation between the 6 measured angular scales. 
As shown in the side greyscale panel, the amplitude of the matrix 
is small making it sensitive to measurement errors and in order to 
estimate a stable covariance matrix we require a very computation- 
ally expensive bootstrap analysis of the data. We have therefore 
only made a detailed comparison on 10% of our fields, performing 
a goodness of fit test to the data over a range of angular scales 
using the model prediction from the zero-lag star-galaxy cross cor- 
relation function in equation [S] In all cases we find the prediction 
is a reasonable model for the measured star-galaxy correlation. We 
also repeated the analysis for our sample fields using the measured 
stellar object ellipticities in contrast to the model PSF ellipticity. 



^ Note that for a single exposure image, equationjsjreduces to gsg (^ab ) ~ 
5sg(0) (ejej)/(e*^) where a and b here indicate objects separated by a 
distance ■ This relationship assumes that the amplitude and angular vari- 
ation of the first three terms on the right hand side of equation |6] is small 
in comparison to the amplitude and angular variation of the star-star auto- 
correlation function. 
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Figure 1. The star-galaxy cross correlation function 5sg{9) for the eight 
individual exposures in example field WlmOmO as a function of angular 
separation (triangles, where each panel is a different exposure). The mea- 
sured angular correlation function in each exposure can be compared to 
the predicted angular star-galaxy correlation (equation|8] shown as a curve) 
calculated using only the zero separation measure 5sg(0) (shown offset, 
circle). The correlation between the exposures and angular scales is shown 
in the covariance matrix of the data points in the upper right panel. Each 
block shows one of the eight exposures and contains a 6 X 6 matrix show- 
ing the correlation between the angular scales. The grayscale bar shows the 
amplitude of the values in the matrix. 



Whilst our measurement errors increased, our findings were un- 
changed such that for the remainder of our systematics analysis we 
conclude that we can safely consider only the zero-lag star-galaxy 
cross correlation function (0) as calculated using the model PSF 
ellipticity. 

3.3 Estimating the level of PSF anisotropy contamination 

Assuming the linear shear measurement model of equation [3] is a 
good description of the systematics within the data, the systematic 
error contribution to the cosmological measure of the two-point 
shear correlation function ^ 



Ms: 



obs,obB^ is given by 



lays , 



which can be estimated from the data vi^f] 

A^obs = """^g 



(9) 



(10) 



When calculating A^obs from a very large area of data, such that 
the PSF is fully uncorrelated with the intrinsic ellipticity, measure- 
ment noise and cosmological shear, the first three terms in the right 
hand side of equation |6] are zero and A^obs ~ ^Csys- In this case 
the PSF correction is deemed successful when A^obs is found to 



* Note that for a single exposure image, eq uations[9|and |l0| reduce to the 
more familiar results of |Bacon et al.| j2"003| with A^gyg = A^{e*e*) and 



be consistent with zero. This method for data verification has been 
applied to many previous weak lensing surveys (see for example 
[Bacon et al. 2003 ) but only in an ensemble average across the full 
survey area and for single stacked images. By taking an ensemble 
average of A^obs across the survey, one explicitly assumes that the 
true level of PSF contamination that we wish to estimate is indepen- 
dent of the variations in the quality of the data. For ground-based 
observations where the data quality varies considerably we might 
expect our ability to remove the PSF to be reduced in some particu- 
lar instances, for example poorer seeing or low signal-to-noise data. 
By determining A^obs averaged across the survey we could easily 
miss a small fraction of the data which exhibit a strong PSF resid- 
ual. In the worst case scenario, as the CFHTLS PSF exhibits strong 
variation in direction and amplitude between exposures, PSF resid- 
ual effects could easily cancel out in an ensemble average (see Sec- 
tion |5.2| for further discussion on this point). We therefore choose to 
apply this methodology to individual one square degree MegaCam 
fields (hereafter referred to as a field), in order to identify fields 
with exposures that exhibit a strong PSF residual. 

For the individual analysis of a one square degree field, we 
can no longer assume that A^obs = ^Csys as the three noise terms 
in the right hand side of equation|6]can be significant simply from 
a chance alignment of cosmological shear, random measurement 
noise or intrinsic ellipticities with the PSF. Using one square de- 
gree patches of the CFHTLenS 'clone' (see Section [3TT) we find 
A^obs > A^sys even when Asys = 0. To illustrate this point we 
multiply each component in equation|6]by the inverse PSF covari- 
ance to define Aobs, 



^ A.^ 



(II) 



such that Aobs would be equal to A^ys, the scale of the true residual 
PSF signal in each exposure, if the noise terms Anoisc and A^ 
could be ignored, where 

Anoisc = C-'((e'"*+7?)e*), (12) 



A-, = C 



'"'(7e*). (13) 

For each CFHTLenS field we first calculate from the measured 
PSF model in each exposure. We then calculate the distribution of 
values we measure for Anoisc and for each field, keeping C 
fixed, but varying e'"' + rj and 7 using all 1 84 independent simula- 
tions from the 'clone'. Figure|2]compares the distribution of values 
measured for each component of Anoisc (dashed) and A^ (dotted) 
for all simulated realizations of the fields, normalized to the total 
number of exposures in the survey. This can be compared to the 
total discrete number of exposures with ^obs as measured from the 
complete CFHTLenS data set (circles). Note that we use a scalar 
symbol here as we show the distribution of measurements over all 
exposures in the survey rather than the vector which contains the 
measurement across all exposures in a particular field. This figure 
shows that the combined distribution of Anoisc and A^ (solid) as 
measured from the simulated data is generally consistent with the 
observed distribution of Aobs over all CFHTLenS MegaCam imag- 
ing. We do, however, observe some outliers from the expected dis- 
tribution and indications of an increased width of the observed dis- 
tribution from the simulated distribution. This comparison reveals 
the presence of a systematic PSF residual signal in a small fraction 
of our data. 

Before we further develop our method to identify problem- 
atic data in Section we should pause to note a general cause 
of concern over our choice to perform this analysis on one square 
degree fields. This illustration particularly highlights the width of 
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Figure 2. The distribution of tlie components of A^^^ for individual ex- 
posures in CFHTLenS. The open symbols show the distribution for the full 
data set. As the number of exposures is discrete we change from a log to 
linear scale below n{A) = 1. The data can be compared with the different 
components of the star-galaxy cross correlation function as measured 
from the simulated CFHTLenS 'clone' data. The dashed curve shows the 
contribution to the star-galaxy cross correlation function from the intrin- 
sic ellipticity distribution Anoiso ™d the dotted curve shows the contribu- 
tion from chance alignments with the galaxy shear field A^, demonstrat- 
ing that significant star-galaxy correlations can be measured from chance 
alignments of the PSF with the galaxy shear and intrinsic ellipticity field in 
one-square degree regions. 



the Anoiso distribution compared to the low levels of Asys that can 
be tolerated in a cosmological analysis. In order for a tomographic 
cosmic shear analysis of CFHTLenS not to be dominated by sys- 
tematic errors we can estimate a ballpark tolerance for the level 
of PSF related systematics in the data ( |Amara & Refregier|2008l l. 
Assuming a conservative average PSF ellipticity of e* ~ 0.1, we 
require A^ys < 0.01 for CFHTLenS. This is an order of magnitude 
less than the width of the distribution of Anoisc on a field by field 
basis. This analysis will therefore only be able to identify the most 
problematic data in our sample, but we show how crucial this step 
is in Section|5] Furthermore we note that for the first weak lensing 
surveys imaged with quarter square degree field of view cameras 
and less, this type of field by field analysis would have been even 
less sensitive to PSF errors. For future space-based surveys with a 
stable PSF pattern between exposures and fixed data quality, data 
could be grouped, however, by PSF pattern to allow for the exten- 
sion of this analysis over much larger areas. Such grouping would 
reduce the width of the Anoisc and A^ distributions, allowing for 
the detection of even the lowest level of PSF residuals. It should 
be noted however that as the Anoisc distribution decreases, this test 
could become sensitive to the fiducial cosmology used in the sim- 
ulations to compute A^. In addition, in a low noise regime, very 
aggressive cuts on Aobs could potentially risk the rejection of high 
shear regions of the data. As these high shear regions have a true 
value of A-y that live in the wings of the distribution estimated from 
the simulations, any cuts must avoid misinterpreting high shear re- 
gions as data with a high A^ys instead. As Figure |2] shows that the 
Anoisc distribution is significantly larger than A^ we clarify that 
we are not in this regime for our analysis of CFHTLenS. 

Finally we remind the reader that using the cross-correlation 



between the galaxy and the PSF model ellipticities does allow us 
to test for systematic errors in the data, but does not allow us to 
isolate the cause of such errors if they are detected. The most likely 
cause would result from residual errors in modeling the PSF. In 
addition, we also expect noise biases to introduce some low-level 
of cross-correlation as described in Appendix C of [Miller et al.| 
(j20T2j. Using a simplified example of Gaussian galaxy and PSF 
profiles, [Miller et aL|p012j show that the mean of the resulting el- 
lipticity likehhood surface is biased away from the true ellipticity 
value, usually leading to a negative multiplicative calibration bias, 
discussed in Section [4?T] (see also Refregier et al. 2012; Ka cprzak] 
|et al.|20 I2 l. The inclusion of a PSF ellipticity in this Gaussian anal- 
ysis revealed that the mean of the ellipticity likelihood surface is 
also biased towards the PSF ellipticity. The extent of this effect de- 
pends on the PSF size and ellipticity and the galaxy size, ellipticity 
and signal-to-noise ratio. It is argued that the bias is also likely 
to depend on the radial surface brightness profiles of the PSF and 
galaxy ( [Miller et al.||2012^ . At a low signal-to-noise ratio, a star- 
galaxy cross-correlation term may therefore develop, although its 
amplitude is hard to predict. We do not, in this paper, attempt to 
separate the effect of this bias from PSF modeling errors, but in- 
stead treat any measured net effect as being indicative of an overall 
bias in the data, regardless of its origin. 

3.4 Identification of observations with significant residuat 
PSF errors 

In order to identify observations with significant residual PSF er- 
rors we wish to calculate the probability that the measured system- 
atic contribution to the two-point cosmic shear correlation function 
A^obs is consistent with Asys = 0. To do this we are required to 
take into account noise contributions from the intrinsic ellipticity, 
random measurement noise and shear as illustrated in Figure[2] 

We first define a vector U , whose components are uncorre- 
cted, such that A^obs = U'^U with 

U = L-^/^V^,g , (14) 

where we have decomposed C into independent eigenvalues and 
eigenvectors such that C = V"^LV (see singular value decomposi- 
tion section of |Press et al.|1986l for more details). There are two 
noise contributions to U . The first comes from the sum of the in- 
trinsic ellipticity, measurement noise and shear contributions to 
that we wish to capture in our analysis. This we estimate by mea- 
suring the variance of U over 184 'clone' realizations of ^^g where 
C is measured from the data. To do this we assume the null hypoth- 
esis that Asys = which is reasonable as, in the case where the 
systematics are non-zero, this method would underestimate the er- 
ror and make the systematics appear more significant. The second 
noise contribution arises from any measurement error in C, which, 
in its inversion in equation[TO] could lead to instabilities in the cal- 
culation of A^obs- For CFHTLenS the PSF correlation between ex- 
posures, given by the off-diagonal components of C, are typically 
very small, ~ 10"^*^, and hence subject to measurement error. Con- 
sidering that the PSF ellipticities are typically of the order lO"'^ 
these off-diagonal terms reveal how uncorrelated and hence how 
unstable the PSF pattern is between even consecutive exposures. 
We estimate a measurement error on C using a bootstrap technique 
dividing the model PSF measured at each galaxy location, e*, into 
CCD chip-sized groups based on their location in the MegaCam 
field of view. We calculate the variance introduced to each com- 
ponent of U measured from the data when C is estimated from 
different CCD chips in the field of view selected at random (with 
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repetition). We find the PSF covariance measurement error term 
to be sub-dominant for the majority of components of U . For the 
components of U with eigenvalues less than a few percent of the 
maximum eigenvalue however, we find that the measurement errors 
on C start to dominate, leading to unstable measures of A^obs- 

In order to ensure our analysis is not dominated by uncer- 
tainty in our measurement of C, we choose to only consider the 
M eigenmodes of C which contribute less than 20% of the total 
noise Oi as measured on each component Ui. The total noise Oi 
per component combines in quadrature the variance of Ui over the 
'clone' simulations and the variance on Ui between different boot- 
strap PSF realizations. To be clear, it is this latter component which 
needs to contribute less than 20% of the total noise for the eigen- 
mode to be considered in the analysis. The discarded modes of the 
PSF covariance C typically have eigenvalues less than ~ 5% of the 
maximum eigenvalue. This step can therefore be considered as a 
principal component decomposition of the PSF covariance matrix 
C to use ~ 95% of the information within the covariance matrix 
to calculate A^obs, removing the other ~ 5% of the information in 
the PSF covariance matrix that is attributed to measurement noise. 
The exact value of the eigenvalue cut is, however, motivated on a 
field by field basis, based on the measured noise in C. As any sys- 
tematics are likely to be dependent on the main PSF correlation 
between fields, the application of this 'de-noising' method to the 
PSF covariance matrix is not expected to bias our results. 

Once the number of M reliable eigenmodes of the PSF covari- 
ance matrix has been established, we can measure a probability that 
the field has A^obs that is consistent with Asys = by calculating 



(15) 



where M is the number of eigenmodes remaining in the analysis 
and the total error Oi on each component Ui combines the two 
errors described above in quadrature. From equation [15] we then 
determine a probability that A^oba is consistent with zero system- 
atics; p{U = 0) = p(x^l'^)- This is calculated from an incomplete 
gamma function jPress et al.|1986[ l with the number of degrees of 
freedom u given by the number of eigenmodes in the analysis M. 
For the standard N = 7 exposure field we typically find M = 4. 



4 RESULTS AND ANALYSIS 

In Section [5] we presented a general method to utilize the star- 
galaxy cross correlation function as measured in each exposure, 
to isolate data where residual signals remain that are correlated with 
the PSF. In this section we discuss the necessary calibration correc- 
tions that we apply to data and the application of our star-galaxy 
cross correlation analysis to select a sample of fields that have a 
star-galaxy cross correlation signal that is consistent with noise. 
We then investigate which fields fail our systematics tests. 



4.1 Calibration Corrections 

Calibration corrections are a standard feature of weak tensing anal- 
yses. They typically consist of a multiplicative component m, cal- 
ibrated through the analysis of simulated images, and an additive 
component c, calibrated empirically from the data. 



where e° is the observed shear estimator for the measurement 
method. This shear estimator is typically an estimate of galaxy el- 
lipticity, which /em-fit defines as in equation |2] but differing pa- 
rameterization between methods is common. With a perfectly cal- 
ibrated analysis pipeline, the resulting shear estimator is unbiased 
with m = c = 0. This is a level of accuracy which can never be 
confirmed, owing to noise in the simulated data, but it is a level 
of accuracy which is aspired to and has not yet been achieved to 
better than percent level precision. Many shear measurement meth- 
ods have been calibrated in the recent GREAT 10 challenge which 
confirmed previous image analysis competitions; all tested shape 
measurement methods suffer from noise bias with \m\ increas- 
ing as the signal-to-noise of the galaxy decreases ( [Kitching et al.| 
|2012a^ . Whilst c is often found to be negligible in image simula- 
tions, this is not always the case with data, as shown by the two 
most recent cosmic shear analyses by^Schrabback et al.|(| 2010[ l and 
|Huff et ar| |201 l l. Schrabback et al. (2010) present a cosmic shear 
analysis of the Hubble Space Telescope (HST) COSMOS survey. 
They use image simulations from |Massey et aL] ( |2007a^ to calcu- 



late m 



-0.078(5/2)" 



where S is a scaled measure that is 



proportional to the signal-to-noise of the galaxy whose shear mea- 
surement is to be calibrated. S = 2 corresponds to a signal-to-noise 
ratio of usn ~ 7 j Erben et al.|200ll l. An additive calibration c is 
also calculated empirically from the data as a function of flux, ob- 
servation date, sky noise, position on the image and galaxy size 
in order to account for the effects of charge transfer inefficiency 
(CTI) in the Advanced Camera for Surveys on HST. This space- 
based empirical correction can be as large as ci ~ 0.04 ('Rhodes! 
et al.|2007|l and affects both components of the ellipticity ( Schrab- 



back et al.|20"T0| ). |Huff et aTlpOTl I present a cosmic shear analysis 



of the deep Stripe 82 data from SDSS. They simulate SDSS Stripe 
82 depth images using a public software package called SHERA 
( [Mandelbaum et al.|2012^ and calculate an average calibration cor- 
rection that combines both a noise bias term and a responsivity cor- 
rection. The responsivity correction is related to the width of the 
intrinsic ellipticity distribution and is expected to result in a cor- 
rection in the region of 0.8 to 0.9 in terms of m, where the exact 
value is dependent on the galaxy population. The SHERA simu- 
lation calibrated combined noise bias and responsivity correction 
corresponds to an average correction of m = 0.776 applied to each 
galaxy. An average additive calibration to the ei component of the 
ellipticity of amplitude Ci — —0.002 is also applied to each galaxy 
to account for a bias which is believed to be introduced by the pref- 
erential direction of the elongated SDSS photometry masks. 

[Miller et al.| ( |2012| ) detail the CFHTLenS image simulations 
used to quantify the required calibration for lensfil finding an ad- 
ditive correction c consistent with zero for the simulated data. The 
multiplicative calibration term m was, however, found to be signif- 
icant and dependent on both galaxy signal-to-noise vsn and size r 
with 



log(l/SN' 



■ exp 



(17) 



and a best-fit a = 0.057 and (3 — —0.37. On a weighted average, 
this corresponds to a 6% correction with (m) — 0.94. As discussed 
in Miller et al.'G012!) an unbiased way to apply this calibration cor- 
rection is through a weighted ensemble average correction, rather 
than dividing by (1 + m) on a galaxy-by-galaxy basis. In the calcu- 
lation of our systematics test parameter U (equation |14^ we there- 
fore calculate the calibration correction U™' given by 



gObs = (1 jji-j^^ 



+ e ]+c, 



(16) 



(18) 
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where the weighted average is taken over the same set of galaxies 
used to calculate U . We remind the reader of the shorthand notation 
used in this paper, as described in equation [4] The components of 
the calibrated systematics test parameter we use in our analysis 
(equation[l5} are then given by Ul""^ = UijUT- 

CFHTLenS image simulations are also used to investigate in- 
trinsic ellipticity bias. [Miller et al.| ( |2012 1| find evidence that the 
Zejiifit weights are very weakly biased in the sense that the galaxies 
that are intrinsically oriented perpendicular to the PSF have slightly 
narrower likelihood surfaces in comparison to those galaxies ori- 
ented parallel to the PSF. This would be an unwanted side-effect 
of any model fitting code that does not attempt to correct for this 
effect, and indeed is something which is also seen in non-model fit- 
ting shape measurement methods jHeymans et al.|2006a[ l. If signif- 
icant for CFHTLenS, this would be identified with our star-galaxy 
cross correlation analysis and the problematic data would be re- 
moved accordingly. 

We now turn our focus to calibrating any required CFHTLenS 
additive calibration correction c empirically from the data, even 
though this was found to be consistent with zero in the image 
simulation analysis. The comparison of equation [16] with equa- 
tion[3]makes it tempting to equate c with the systematic error term 
discussed in Section 3 where c = A^y^e* . As we can remove 
any data with significant PSF residuals, however, such that A^y^ 
is consistent with zero, any strong additive calibration term c re- 
maining in the data that passes our systematics tests must derive 
from an alternative source of error. For the data that passes our 
systematics tests, as discussed further in Section [42] we measure 
an average (ci) = {w^f"-) = 0.0001 ± 0.0001 that is consis- 
tent with zero. For the second component of the ellipticity however 
we find an additive calibration that is significantly non-zero with 
{C2) = {w(°2^) = 0.002 ± 0.0001. Detailed investigation showed 
this effect to be independent of PSF size, PSF ellipticity and galaxy 
type. There is however a clear dependence on galaxy size r, and 
signal-to-noise ratio vsn, where both parameters are determined 
by lensfit. We find that it is the smallest brightest objects contribut- 
ing the strongest signal to the additive calibration. This can be seen 
in the upper panel of Figure [3] which shows the weighted average 
{^2^'^) as a function of galaxy size for four galaxy samples split by 
signal-to-noise ratio z^sn. The average is taken over all fields that 
pass our systematics test, where the field selection is presented in 
Sectionl431 

To quantify the additive component of the calibration correc- 
tion, we model C2 as a function of size and signal-to-noise as 



C2 = Max 



-PlogioC^^SN) - G 



1 + 



(19) 



and use a maximum likelihood method to fit the free parameters 
[F, G,H,ro] to the data finely binned with 20 bins in r and 8 
bins in usn- These were chosen such that there was sufficient 
resolution in size to map the upturn for small galaxy sizes and 
sufficient signal in the data to get a reliable fit. The parameters 
are strongly degenerate with a best fit reduced ~ 1-4 with 
[F = 11.910, G = 12.715, H = 2.458, ro = 0.01"]. To illustrate 
the calibration correction we show the best-fit model on the upper 
panel of Figure[3]for the mean signal-to-noise ratio of each sample, 
emphasizing that the data shown here is for four broad bins in usn 
and that the model was fit to finer bins. The lower panel shows the 
effect of the C2 calibration correction when applied to each galaxy, 
reducing the average additive signal (shown circles) to be consis- 
tent with zero at all scales (shown triangles). We do not calculate 
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Figure 3. Additive Calibration Correction. The upper panel shows the 
weighted average (ej'^'^) as a function of galaxy size, r, for four galaxy sam- 
ples split by signal-to-noise ratio (circles, i^sn ~ 10, crosses, I'SJV ~ 15, 
open triangles, j^sn ~ 20, and filled triangles i/g]v ~ 50 ). It is therefore 
the small bright galaxies that contribute the most to the average raw mea- 
sure of (e^^^) shown in the lower panel (circles). The r and i/gN dependent 
model in equation^Jis fit to the data in fine bins of size and signal-to-noise 
ratio. The best-fit calibration correction for each i/gN in the upper panel is 
shown by the dashed curves. The average (e^*^^) when measured from the 
calibrated galaxy catalogues is shown to be consistent with zero (triangles, 
lower panel). All errors come from a bootstrap analysis of the catalogue. 



errors on the free parameters in the model for the additive calibra- 
tion correction as, if uncalibrated, a systematic additive error would 
enter into the cosmic shear signal only at the level of lO^''. Hence 
the error on the additive calibration correction is negligible in our 
error budget. For simplicity we apply this small additive C2 correc- 
tion on a galaxy-by-galaxy basis. This is in contrast to the ensemble 
average multiplicative m correction (see for example equationfTS]!, 
as unlike the division by 1 + m on a galaxy-by-galaxy basis, the 
subtraction of C2 is stable. 

It is unclear what causes this bias in the measured £2 compo- 
nent for the small, bright galaxies. We can rule out the CTI that 
impacted Schra bback et al.| ( |2010 ) as CFHTLenS has a high sky 
background and CTI affects predominantly the ei component. We 
can also rule out the hypothesis of masking bias in |Huff et al.H201 1} 
as the smaller masks in CFHTLenS are generally circular or ran- 
dom in orientation with the exception of the stellar diffraction spike 
masks which, according to Huff et al. (201 1 1, would affect the ei 
component not £2. One immediate potential source of error that 
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would affect 62 more strongly than ei is the undersampling of the 
PSF as the size of the pixel in the £2 direction is longer than in 
the ei direction. This effect is, however, not seen in the analysis of 
image simulations which include the correct MegaCam pixel scale 
and typical CFHTLS PSFs, modeled using the best-fitting |Moffat| 
([1969} light profile to each exposure in the survey. As the effect 
is uncorrelated with PSF properties, we are therefore led to con- 
clude that it most likely arises from the data analysis and could 
possibly be linked to an unknown effect in the data processing or 
within the CFHT MegaCam camera. Possible examples could be a 
non-isotropic charge diffusion within the CCD, non-uniform intra- 
pixel response, residual errors in the cosmic ray masking which 
have a non-isotropic behaviour or low-level long-term persistence 
or ghosting effects, as the CFHTLS dither strategy is in the posi- 
tive (ei, £2) direction. We note that there is concern that a random 
camera or data processing related affect could also impact the PSF 
modeling, as the stellar objects are small and bright. The unknown 
cause of this bias is therefore also potentially linked to the fraction 
of fields that we reject using our star-galaxy correlation systematics 
test. 



4.2 Field Selection 

For each field we calculate the systematics test parameter U (equa- 
tion [T4I applying the calibration corrections described in Sec- 
tion |4.1| We then calculate the probability that A^obs is consistent 
with zero systematics; p{U = 0) as detailed in Section 3.4 and 



100% of fields: p(U=0) > 0.00 



set an acceptance threshold on this probability using a method that 
we demonstrate in Figure |4] Here, in the upper panel, we show the 
measured systematic error observable E(A^obs) where the sum is 
taken over all fields (hatched area includes the la bootstrap error 
on the measure). This can be compared with the distribution of val- 
ues obtained from all the different realizations of the CFHTLenS 
'clone' (solid). The 'clone' distribution shows the probability of 
measuring E(A5obs) from the full survey area if there were no 
PSF residuals in the data. Note that by definition A^obs at zero- 
lag is a positive quantity (see equation [T0| so even for the sim- 
ulated 'clone' catalogues which have zero systematics, by defini- 
tion, E(A^obs) is non-zero. For comparison we also show the dis- 
tribution of E(A^oba) that would be measured simply from a ran- 
dom correlation between the pure cosmic shear 7 and the range 
of CFHTLenS PSFs (dashed). The significance of this signal re- 
iterates the points made in Section |3.3| of how important it is to 
take into account both the random intrinsic ellipticity noise and un- 
derlying cosmic shear in this type of systematics analysis. 

The conclusion we can draw from the upper panel of Figure|4] 
is that when we consider the full data set, the sum of the mea- 
sured star-galaxy cross correlation is very significant compared to 
the expectation from the simulated 'clone' catalogues. We there- 
fore set a criterium that selects only those fields above a tunable 
threshold probability that A^obs is consistent with zero system- 
atics; p{U — 0). By increasing the cut on p{U — 0) the mea- 
sured systematic error observable E(A5obs) decreases rapidly as 
using p{U — 0) for our selection criteria preferentially rejects the 
fields with the strongest systematic residual eiTors. As the number 
of fields in the analysis decreases, the E(A^obs) expected from the 
'clone' also decreases. This is because it is summed only over the 
number of fields remaining in the analysis and there are fewer pos- 
itive numbers to sum. We continue this rejection process until the 
la confidence region on our measured systematic error observable 
E(A5obs) is in agreement with the peak of the probability distri- 
bution expected for this quantity from the same number of fields in 
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Figure 4. Comparison of the measured S(A^obs) (hatched) where the sum 
is taken over all fields (upper panel) or over the fields with a measured 
probabihty of zero systematics p{U = 0) > 0.11 (lower panel). These 
measures can be compared with the probability distribution of measuring 
S(Agobs) from the same number of fields realized in the systematics free 
CFHTLenS 'clone' (solid). For the full data set (upper panel), we find that 
the measured S( A^o^s) far exceeds what is expected from the simulations. 
Once a consei'vative cut is applied to the data (lower panel) removing 25% 
of the data, we find the measured S(A^obs) is fully consistent with the 
expected distribution for the same number of simulated fields. For compar- 
ison we also show the probability distribution of S(A^obs) as measured 
from a random correlation between the pure cosmic shear 7 and the range 
of CFHTLenS PSFs (dashed). 



the 'clone' simulations (lower panel). It is interesting to note that 
the variance of the simulated distributions also becomes consistent 
with the la error on the measured E(A^obs) when the threshold 
selection is optimised in this way. This process sets a threshold of 
p(U = 0) > 0.11 below which we label the field as 'failed'. This 
leaves us with 75% of CFHTLenS fields which pass the systemat- 
ics test. We investigate the impact of this cut for two-point cosmic 
shear statistics in Section[5] 

For a complete and detailed account of the analysis we should 
clarify at this point that the field selection and empirical C2 additive 
calibration correction described here and in Section [4T| is actually 
calculated using a two-step iteration. We first select fields apply- 
ing only the multiplicative m calibration correction (equation | 18[l 
as calculated from our simulated image analysis in |Miller et al.| 
pOTIj l. This first -pass field selection safeguards that the empiri- 
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cal C2 calibration correction we calculate from the selected data 
is unrelated to the PSF. The additive correction that is empirically 
calculated from these selected fields is then applied to the full sur- 
vey. We then re-run our systematics analysis on the full survey 
to re-select fields which pass the systematics tests when both the 
multiplicative and first-pass additive calibration corrections are in- 
cluded. This safeguards that in the first-pass iteration, the additive 
error term, now corrected by the C2 calibration, did not mask the 
presence of PSF residuals, or appear as a PSF residual in exposures 
where the PSF is predominantly in the direction. At this second- 
pass iteration we lose two fields and gain seven fields into our se- 
lected clean data sample. Finally we empirically re-calculate the 
additive calibration correction C2 for this final set of selected fields 
to improve the accuracy of the correction on the final field sample. 
This re-calculation introduces a small percent level adjustment to 
the first-pass measure and is the C2 calibration that is presented in 
equation|19| 

Finally we discuss duplicate fields, originally imaged with an 
i'.MP9701 filter, and re-imaged, after this initial filter was damaged 
in October 2007, with the replacement i'.MP9702 filter. In general, 
we do not distinguish between these two periods of i' imaging, al- 
though the different filter response curves are of course accounted 
for in our photometric redshift analysis |Hildebrandt et al.|2012| >. 
For the purposes of this discussion, however, we will refer to these 
two filters as i'^ and i'2 . Duplicate fields were re-imaged in order to 
calibrate and assess the impact of the change of filter mid-survey, in 
addition to some cases where preliminary concerns about the PSF 
in the original observations led to their re-observation. In total 18 
fields were imaged to close to full depth in both the i'l and i'2 band 
in the optimal seeing conditions required for the lensing analysis. 
In principle, we may expect the second pass 12 observation to be 
the best data as the re-imaging for some of these fields was un- 
dertaken to improve the PSF. It is therefore interesting to compare 
the systematics results for these 18 fields, and clarify which of the 
fields we choose to pass in the cases where both the i'l and i'2 band 
images pass our systematics analysis. Out of the duplicated fields, 
2 failed the systematics test in both filters, 5 passed only in i'-^ and 1 
passed only in i'2. Out of the 10 fields which passed in both i'l and 
i'2, we choose to select data for our scientific analysis from the filter 
with the highest probability p{U — 0) with 4 fields selected in i'l 
and the other 6 in 12 . With such a small number of fields we cannot 
draw any significant conclusions on the dependence of systematics 
on the filter, or whether the preliminary concerns about the PSF in 
some of the duplicate observations were warranted. 

4.3 Comparison of fields which pass and fail systematics tests 

The star-galaxy cross correlation field selection presented in Sec- 
tion |4.2| identifies 25% of fields which have a significant PSF resid- 
ual signal. Whilst it is disappointing to reject this fraction of data 
from our cosmological analysis of CFHTLenS, it is a major step 
forward to be able to robustly identify the data that adds system- 
atic error into our analysis. Crucially we can do this in an analysis 
that is insensitive to assumptions about the cosmology of the Uni- 
verse that we wish to determine. Factoring in the 19% area that is 
lost to masked defects such as stellar diffraction spikes (see section 
\53\ , our final total rejected area is 39% of the survey. We can com- 
pare the amount of rejected CFHTLenS data to the fraction of data 
rejected in previous CFHTLS analyses which totaled 30% in |Hoek-| 
|stra et al.|f2006| l and 40% in |Fu et al.H2008> , which arose in both 
cases from a combination of masked defects and the conservative 
chip boundary masks that we do not apply in our analysis. 
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Figure 5. Comparison of fields which pass and fail the PSF systematics test 
to different observables. The open symbols indicate fields that pass. The 
crosses and filled circles indicate fields that fail. These two sets are split 
into data observed before the CFHT 'lens flip' (open and filled circles) and 
the majority of the data observed after the CFHT 'lens flip' (open triangles 
and crosses). Each panel shows a different combination of observables; up- 
per left the average PSF ellipticity in both components; upper right, the 
average variation of the PSF ellipticity across the field of view; lower left 
a comparison of the average variation across the field of view to the PSF 
variation between the dithered image exposures of the field; lower right a 
comparison of seeing and airmass. See text for the other combinations of 
parameters linked to the observations that were also found to show no clear 
trend between the accepted and rejected fields. 



What can we learn from this set of fields where we are un- 
able to adequately correct for the PSF distortion? Figure [5] shows 
just four examples of the many plots we inspected to compare how 
different observables related to the type of fields which passed or 
failed our systematics analysis. Each point represents a different 
field with the tick mark indicating whether the field passed or failed 
the systematics analysis; open symbols indicate a field that has 
passed and crosses or filled circles indicate a field that has failed. 
Before describing the figure in any detail, the key point to imme- 
diately note is that there is no combination of observables which 
separates the fields that fail from the fields that pass; the locus of 
the fields that pass is closely followed by the locus of the fields that 
fail. This statement is true for all the different combinations of an 
extensive set of observables that were tested, as listed at the end 
of this section. Focusing now on Figure [5] in detail, the upper left 
shows the average PSF ellipticity in both components; upper right, 
the average variation of the PSF ellipticity across the field of view; 
lower left a comparison of the average variation across the field of 
view to the PSF variation between the dithered image exposures of 
the field; lower right a comparison of seeing and airmass. 

At the start of the CFHTLS observations the MegaCam PSF 
did not meet the design specifications but an unexpected solution 
was found by CFHT such that the image PSF became better be- 
haved. This solution involved inverting or 'flipping' the third lens 
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of the wide field corrector that is installed in front of the camera. 
We can see the impact of this measure in the top right hand panel 
of Figure [3] where we distinguish fields that were imaged before 
the CFHT 'lens flip' (open and filled circles) and the majority of 
the data observed after the CFHT 'lens flip' (open triangles and 
crosses). This panel shows the variation of the PSF ellipticity across 
the field of view for both components for each field. In this plane 
we can easily separate the data taken before the CFHT 'lens flip' 
owing to its strong variation across the field of view in both ellip- 
ticity components. What is interesting however is that we cannot 
separate the fields that fail and pass: the same fraction of fields fail 
before and after the CFHT 'lens flip'. This indicates that the spatial 
variation of our lensfit PSF model is sufficiently flexible to fit even 
the strongest PSF variation across the field of view. 

Our ability to correct PSF distortion is expected to depend on 
the amplitude of the ellipticity and size of the PSF. The upper left 
panel of Figure [5] shows the distribution of fields as a function of 
the average PSF ellipticity in both components. The majority of 
the data has PSF distortions within a few percent, with a slightly 
stronger ej; component. The lower right panel of Figure [s] shows 
the distribution of fields as a function of seeing (PSF size) and air- 
mass. All the i' data have sub-arcsecond seeing with the CFHTLS 
seeing criterion of better than 0.8 arcsec met for the majority of the 
data. There are a handful of poorer seeing fields, however, where 
weather conditions deteriorated during the exposure, for example, 
in addition to early data taken before the CFHT 'lens flip' whilst 
the survey queue scheduling system was optimised. The survey was 
observed at low airmass as CFHT does not have an atmospheric dis- 
tortion corrector. As before, we see very little to distinguish fields 
that pass and fail in this space. It is clear that lensfit is able to 
produce robust results for even the strongest PSF distortions with 
|e*j ~ 0.1 and the poorest of seeing, yet even with good seeing 
conditions and percent level PSF distortion our analysis can fail. 
For the small sample of fields with seeing better than 0.6 arcsec all 
fields pass. This is a small sample so we are unable to make a con- 
clusive statement on this, but it is gratifying to see that nothing in 
the analysis appears to compromise our ability to analyse the best 
quality data. 

We now turn to the lower left panel of Figure |5] which com- 
pares the average variation of the PSF ellipticity across the field 
of view to the average variation of the PSF ellipticity between the 
exposures. Again we find nothing to indicate why fields pass and 
fail, but this plot illustrates the importance of analysing data from 
individual exposures compared to a stack, as it shows the variance 
of the PSF between the exposures is as significant as the variation 
of the PSF across the field of view. This means that the PSF in a co- 
added image stack is a complex blend of varying elliptical profiles, 
breaking the typical assumptions about PSF profiles used in shape 
measurement methods. Lensfit is the first method to optimally take 
into account the PSF variation between the exposures in a simul- 
taneous analysis of the unstacked data, making it a significant ad- 
vance in shear measurement. 

Figure [5] shows just four example plots to demonstrate the 
properties of the CFHTLenS data set. We have also investigated po- 
tential correlations of the fields that pass and fail with wind speed, 
wind direction relative to the telescope dome slit, dome and CCD 
temperature, the number of stars, the area of the field lost to cos- 
mic rays and defects, the fraction of area masked due to satellites or 
bright stars, the variance of the PSF size across the field of view and 
between exposures, the number of dithered exposures, higher order 
moments of the PSF and the Strehl ratio of the PSF, as estimated 
from the fraction of light in the central pixel. We find no clear indi- 
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Figure 6. Comparison of the two-point shear coiTelation statistic 5+ (upper) 
and ^_ (lower) measured using all the data (dashed) and only the 75% of the 
fields which pass our tests. The impact of the application of the calibration 
correction detailed in |Miller et al.H20T2) is shown to be within the statistical 
errors. 



cation of a linear combination of observing conditions that causes 
some fields to fail our analysis. This conclusion led us to look at 
random effects such as higher order correlations in the PSF distor- 
tion caused by atmospheric turbulence that we would be unable to 
model owing to the limited stellar number density. This study is 
detailed in |Heymans et al.| ( |2012) which concludes that for the 600 
second CFHTLenS exposure times, the high order turbulent corre- 
lations in the PSF are negligible. 



5 SYSTEMATIC ERROR ANALYSIS OF TWO-POINT 
COSMIC SHEAR STATISTICS 

In the previous section we identified a small fraction of data which 
failed our criteria for cosmic shear analysis and we now assess 
the impact of this field selection on the two-point shear correla- 
tion statistic ^± (equations |4] and |5|l which we use to place tight 
constraints on cosmological parameters in Kilbi nger et al.| ( |2012[ l. 
Figure|6]compares the two correlation functions as measured from 
the full survey area (dashed) and as measured from the survey area 
which passes our systematics tests (solid). We can see that the field 
selection has a significant impact on the amplitude of the signal at 
large scales in ^-|- and that the calibration corrections applied to the 
data, described in Section [4?T] (crosses in the full survey area case 
and circles in case of the fields that pass) make no significant dif- 
ference to the measured signal. Note that the errors come from a 
bootstrap analysis. 

It is crucially important at this point of the analysis not to in- 
troduce confirmation bias into our cosmology analysis. This would 
occur for instance if we would assess the performance of our sys- 
tematics by comparing our results to those we expect from pa- 
rameter constraints from other surveys. The field selection and the 
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Figure 7. Upper Panel: comparison of the two-point shear correlation 
statistic ^+ measured at 10 arcmin for varying systematics thresholds on 
piU = 0). The percentage of fields that remain at each threshold is shown 
in the lower panel. Note that the errors are correlated as they show the field- 
to-field variance between the fields used at each data point. In addition they 
do not include the increasing sampling variance eiTor as the number of fields 
decrease. The dashed lines indicate the optimal threshold that is defined in- 
dependently, in Section[4!2l 



threshold optimization to set p{U = 0) > 0.11, described in 
section [421 therefore made independently and blindly to any 
knowledge of its impact on the cosmological parameter constraints 
being investigated independently within the team. To confirm the 
robustness of the systematics selection we developed an indepen- 
dent test, making progressively more conservative cuts to the data 
by increasing the threshold set on p{U = 0), the probability that 
the fields had A^obs consistent with zero (see Section |4]l. This is 
demonstrated in Figure|7]which shows, in the upper panel, the two- 
point shear correlation statistic ^+ measured at 10 arcmin using 
all data with a measured p{U — 0) above the varying threshold 
shown on the horizontal axis. The percentage of fields that remain 
at each threshold is shown in the lower panel. This figure shows a 
progressive decrease in the amplitude of the large scale ^+ signal 
as fields with systematic errors are progressively removed from the 
analysis with the increasing cut on p(U = 0). This decrease in am- 
plitude continues until the optimal threshold of p(U = 0) > 0.11, 
set using a completely different and independent method in Sec- 
tion |4.2[ is reached (shown dashed). Removing more fields beyond 
that threshold gradually increases the noise on the measure, as the 
survey area decreases, but does not decrease the amplitude of the 
large scale power any further. This test gives us confidence that our 
selection criteria, which we define independently in comparison to 
simulations, is optimal and unbiased within the statistical noise of 
the survey. 
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Figure 8. Comparison of the variance between fields of the two-point shear 
congelation statistic (upper) and ^_ (lower). Using all the data reveals 
a significant increase in the variance between fields on large scales. Using 
only the 75% of the fields which pass our tests we find a field-to-field vari- 
ance that is consistent with the sampling variance expected from a WMAP5 
cosmology. 



5.1 Field-to-fleld variation of large-scale cosmic shear signal 

One of the first indications of systematic errors in the catalogues 
used in |Fu et al.| ([2008| came from an analysis of the field-to- 
field variation on large scales ( [Kilbinger et al.|f2009) . This is a 
cosmology-dependent test which we can not, and do not, use for 
field selection. As we do not use the covariance of the two-point 
correlation function to estimate cosmological parameters in any fu- 
ture analyses however, we are afforded the opportunity, at this fi- 
nal stage, to compare the field-to-field variance measured to that 
expected for a WMAP5 cosmology. In Figure [8] we therefore re- 
peat the [Kilbinger et aL] ( |2009^ field-to-field variation analysis on 
the CFHTLenS data, showing the variance of the two-point shear 
statistics ^± measured between the fields as a function of angu- 
lar scales where the errors shown come from a bootstrap analysis. 
The variance between the fields for the full survey (dashed) can be 
compared to the variance between the fields that pass our systemat- 
ics tests, showing that the field selection brings down the variation 
between the fields significantly. In addition the field-to-field vari- 
ance can be compared to the variance expected from a WMAP5 
cosmology as predicted by an analysis of the CFHTLenS 'clone' 
(see Section [3TT) , which is shown (dotted) to agree very well with 
the measurements. 



5.2 Analysis of tools previously advocated to detect 
systematic errors 

In previous weak lensing analyses, two standard tools have been 
used to infer a level of systematics in the data that is consistent 
with zero. The first test is essentially A^obs(S) (equation [10[l but 
averaged over the full survey, and using an average PSF from the 



© 0000 RAS, MNRAS 000, 000-000 



CFHTLenS 17 



10-2 

10-3 

10-" 

10-s 

+ 10-6 

' 10-' 

10-8 

10-9 
10-10 

10-" 



IT 

r f 



n 1 r 



C) 



() 



LJ I L 



10 

0(arcmins) 



10 
10 
10 
10 
I 10 
' 10 
10 
10 
10- 
10- 



n 1 1~ 

|ixAll Fields 
j-OPass Fields 



'■ji-Nt PSF Correction 
ACDM 

T 



10 

0(arcmins) 



Figure 9. Comparison of the C^'' statistic often used to test sy stematics, as 
measured on the full data set (crosses) and the fields which passed our sys- 
tematics tests (circles). The measured systematic signal can be compared to 
the lensing signal we would expect to measure for a WMAP5 cosmology 
(shown dashed). We find that even when using the full sample we find a 
signal for C^f" that is consistent with zero, concluding that the C^Jf^ statis- 
tic does not provide a very sensitive test of the levels of systematics in the 
CFHTLenS survey. The average CFHT MegaCam PSF correlation function 
{e*e*) is also shown for comparison (stars) 



stack. This is often written as 



(20) 



( [Bacon et al.|2003l > where the correlations are calculated following 
equation]?] Figure [5] compares this statistic as measured from the 
full CFHTLenS survey (crosses) and the selected areas of the sur- 
vey that pass our systematics tests (circles). The results show both 
measures to be consistent with zero and well below the amplitude 
of the lensing signal we would expect to measure for a WMAP5 
cosmology (shown dashed). Note that we choose to use a log-scale 
to demonstrate how small this measured systematic signal is, at the 
expense of losing negative data points which are all consistent with 
zero signal. With this test alone we might incorrectly conclude our 
full survey was systematics free. In this full survey case, we sus- 
pect that the errors are averaging out when this statistic is measured 
across the survey. This therefore makes it crucially important to un- 
dertake systematics error testing on a field by field and exposure by 
exposure basis, as we have with our measurements of A^obs (equa- 
tion]TOt. 

Figure ]9] also shows the CFHT MegaCam PSF correlation 
function {e*e*) (stars) which has, on average, zero signal in the ^_ 
correlation (lower panel) compared to a significant essentially con- 
stant signal in the ^+ correlation (upper panel). In Figures ]6]and]8] 
we see that the impact of our field selection is always in the ^-|- 
statistic. This therefore suggests that at least part of the systematic 
signal in our failed fields is related to the PSF. This could poten- 
tially arise from PSF modeling errors or as a result of noise bias in 
the likelihood surfaces as discussed previously in Section]33] 



The second standard weak lensing systematics test is an E/B 
mode decomposition of the signal. Cosmic shear produces an al- 
most pure E-mode only distortion pattern, with some small B-mode 
signal expected at angular scales less than a few arcminutes, re- 
sulting from source redshift clustering ^Schneider et al.|2002[ l. A 
detection of a significant B-mode indicates the presence of a non- 
lensing systematic signal in the data. As the CFHTLenS PSF has no 
correlation on average, the B-mode caused by any PSF residu- 
als will equal the E-mode caused by the same PSF residuals. In this 
particular case, therefore, an E/B mode decomposition is effective 
at detecting a PSF-related systematic. One drawback however is 
that to avoid bias in the E/B measurement from data, the most typ- 
ically used E/B mode decomposition method from |Schneider et^T] 
(J2OO2J formally requires an input cosmology which we generally 
wish to avoid using in our blind systematics tests (see for exam- 
ple ]^lbinger£t^L]2006| Alternative E/B decomposition statistics 
have been proposed to avoid this cosmology dependence, ( Schnei^ 
|der & Kilbinger|2007[]S"chneider et al.|2010| l, and we will investi- 
gate these statistics in future work. Figure] 10]compares the B-mode 
signal measured from the data using three different two-point statis- 
tics; the top-hat shear statistic 7^ (upper panel), the mass aperture 
statistic (middle panel), and the two-point shear correlation 
function (lower panel). These were calculated by integrating 
over the two-point shear correlation function measured from 
the data in 2000 fine angular bins between 0.13 < 6 < 70 ar- 
cmin, using the decomposition filter functions in equations 26, 32, 
41 of [Schneider et alT] ( |2002^ . We calculate the required cosmol- 
ogy dependent normalization by extending the measured ^± using 
a WMAP5 cosmology model over the small and large angular scales 
that are inaccessible to the CFHTLenS data (see [Heymans et aT] 
[2005 1 for more detail on the implementation of this correction). The 
B-modes measured from the fields that pass our systematics tests 
(circles) are consistent with zero on all scales. This can be com- 
pared with the B-modes measured from all fields (crosses) which 
are found to be significant on some scales for some of the statistics 
tested. In the case of the B-mode of the two-point shear correlation 
function (lower panel) we clearly see an excess large-scale B- 
mode signal when all the data is analysed. This signal is of similar 
amplitude to the change in the measured two-point shear correla- 
tion function shown in Figure ]6] when the failed fields are re- 
moved from the analysis. We note however, that the B-modes for 
all fields, whilst significant, are well below the amplitude of the 
lensing signal that we would expect to measure for a WMAP5 cos- 
mology (shown solid). Had only this B-mode test been performed, 
one might therefore conclude that shear systematic errors were tol- 
erable. Yet in Figure]6]the difference made by the field selection of 
Section ]42] can be clearly seen. 

5.3 The impact of using automated or manual masks for 
cosmic shear 

The importance of correctly masking data to remove image arti- 
facts has been long understood in all applications of astronomy, 
but it is of particular importance for weak lensing analyses. Even a 
small fraction of diffraction spikes or broken detections along long 
satellite trails erroneously present in a galaxy catalogue can intro- 
duce a strong systematic signal as they align. The THELI reduc- 
tion pipeline includes an automated masking routine to detect for 



such artifacts (see Erben et al. 2012 for more details). Using the i'- 



band stacked data, we manually inspect and modify these candidate 
masks for use in our final analysis. In the majority of the cases any 
modifications made are because the automated routine is being too 
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Figure 10. The B-mode signal measured using three different two-point 
statistics; the top-hat shear statistic 7^ (upper panel), the mass aperture 
statistic M'j^ (middle panel), and the two-point shear correlation function 
(lower panel). The B-modes measured from the fields that pass our sys- 
tematics tests (circles) can be compared with those measured from all fields 
(crosses) and the E-mode amplitude of the lensing signal that we would 
expect to measure for a WMAP5 cosmology (shown solid). 
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Figure 11. The two-point shear' congelation statistic 5+ measured using the 
sample of data that passes our systematics tests masked in three different 
ways; galaxies outside the final manually improved masks (circles), galax- 
ies outside the automated initial candidate masks (crosses) and galaxies that 
lie within a masked region (stars). The automated and mask data points 
(crosses and stars) are offset in each 9 direction for clarity. 



conservative and the manual inspection recovers back ~ 1% of the 
valuable area masked unnecessarily. Manual inspection also detects 
the occasional faint satellite trail and missed faint diffraction spike 
by THELI and has the added benefit of ensuring that at least one 
person has inspected each image for any unusual behaviour. The 
value of this effort however has to be balanced by the number of 
human hours invested in such an endeavor, which for CFHTLenS 
consisted of roughly 20 minutes per field and hence a total of ~ 60 
hours for the full survey. In addition, whilst every effort was made 
to maintain a quality control by comparing two independent sets 
of masks for a sub-set of fields and running the majority of fields 
through a second-pass quality assurance check, the nature of vi- 
sual inspection is, by definition, subjective and therefore inherently 
dependent on the team members who performed the task. 

We now quantify how important this manual masking step is in 
Figure[TT| by comparing the two-point shear correlation statistic ^-|- 
measured using the sample of data that passes our systematics tests, 
masked in three different ways; galaxies outside the final manually 
improved masks (circles, as used in all analyses so far), galaxies 
outside the initial automated candidate masks (crosses) and galax- 
ies that lie within a masked region (stars). The results show that 
for the statistical accuracy achieved by CFHTLenS, the signal us- 
ing the candidate mask and manual mask are consistent. We should 
note that in contrast to previous lensing surveys, CFHTLenS ap- 
plies a model fitting method to the data. Any objects that appear 
to have a complex morphology, or those that are very poorly fit by 
a two-component galaxy model are therefore given a zero weight 
and thus rejected from the sample. As such lensfA is providing 
an alternative for the removal of any artifacts missed by the auto- 
mated masking routine. The strong signal from the objects within 
the masked region does however show the importance of removing 



regions of image artifacts and that using a model fitting analysis 
alone is not sufficient. This result is very promising for the use 
of automated masking in future surveys. When survey statistical 
power grows significantly compared to CFHTLenS however, the 
robustness of automated masking should be re-assessed through an 
extensive series of image simulation analyses. 



6 TESTING REDSHIFT SCALING WITH 
GALAXY-GALAXY LENSING 

In |Kilbinger et al.| l |2009[ >, a preliminary tomographic analysis of the 
CFHTLS found a cosmic shear signal for galaxy redshifts z > 1 
that was inconsistent with the signal expected from the best fitting 
ACDM cosmology as measured from a purely 2D analysis of the 
data in |Fu et al.j (2008). This was seen as an indication of system- 
atic biases suggesting either an error with the photometric redshifts 
used at that time or a redshift dependent shape measurement bias. 
In light of this earlier finding we require a final test of the robust- 
ness of the CFHTLenS data to the variation in lensing signal with 
redshift. In this section we therefore develop and implement a red- 
shift scaling test for the data. We are unable to use the cosmology 
dependent redshift testing methodology of |Kilbinger et al.| ( |2009[ l, 
as we use the CFHTLenS tomographic signal to constrain cosmo- 
logical parameters in |Benjamin et al.| ( |2012^ . Our method therefore 
instead considers the redshift evolution of the tangential ellipticity 
et(6') measured about a lens at angular separation 9 from a source 
with complex ellipticity e (equation[2j. This is commonly known as 
galaxy-galaxy lensing whose redshift evolution is only very weakly 
sensitive to our assumptions on the fiducial cosmology, in contrast 
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to the evolution of the cosmic shear signal which is very sensitive 
to a change in cosmology. 

The mean tangential shear around a sample of lens galaxies 
can be approximately described by a singular isothermal sphere 
(SIS) shear profile, partelmann et al.|200l| l 
2ty 



c ) \D, 



where CTv is the characteristic velocity of the SIS, c is the speed of 
light, and the ratio of angular diameter distances 



Ds 



(22) 



fvi(^w{zs) - ■w{zi) 



/k( W(2s) 



is a purely geometrical factor that depends on the effective redshift 
probability distribution function pf\z) of lenses and pi\z) of 
sources, /k (w) denotes the co-moving angular diameter distance 
for a co-moving radial distance w and curvature K of the fidu- 
cial cosmological model (see for example [Bartelmann & Schnei-| 
|der|200l) . Zmax is the maximum observed galaxy redshift beyond 
which Pa{z) is zero. For a given lens subsample in a fixed redshift 
bin i, the mean tangential shear of galaxies in redshift bin j in- 
creases with mean source redshift of the bin. The relative change 
of the shear signal (for a fixed lens redshift) is a very weak func- 
tion of the cosmological parameters, such that equation[22|robustly 
predicts the relative change of 7''^' {6) for a wide range of cosmo- 
logical parameters. The absolute amplitude of 7['"'' for a fixed i is 
given by the SIS velocity which will vary for each lens sample. 
We choose to use the simple SIS profile for the purposes of this test 
as this has been shown to be a good model for the data for angular 
scales ^ < 3 arcmin ( |van Uitert et al.|201 l| l. 

We select a sample of lens galaxies with magnitude 1^,3 < 
22.5 and the full CFHTLenS catalogue for our source sample. All 
galaxies have a photometric redshift estimate and associated error 
distribution P{z) ( [Hildebrandt et al.||2012f . We use these to ini- 
tially split both samples into = G photometric redshift bins; 
[0.0,0.2], [0.2,0.4], [0.4,0.5], [0.5,0.7], [0.7, 0.9], and [0.9,1.3], 
and then calculate the effective redshift distribution for each bin 
from a weighted average of the probability distributions of the 
galaxies within each bin, incorporating each source galaxy weight. 
For each of the N'^ combinations of lens i and source j subsamples 
we compute the mean tangential shear 7t'''^ (^) by averaging source 
ellipticities et around the lenses in bins with an angular scale rang- 
ing from 0.8 arcmin to 4 arcmin. As we are interested in testing 
redshift scaling we measure et for sources that are both in front of 
and behind the lenses. We take a weighted average using the stellar 
mass of the lens, estimated photometrically from the data ( j Velander| 
|et al.|20l"2) . This weighting scheme aims to optimize the signal-to- 
noise of the measurement, as the stellar mass is expected to corre- 
late with halo mass. The measurement is made for each field which 
passes our systematics tests in Section[42]and we make a Jackknife 
statistical error estimate of the mean signal over the survey ( |Shao| 
[T986| >. 

Figure [12] shows the amplitude of the best-fitting SIS model 
1 arcmin as a function of the mean photometric 
redshift of the sources for the six photometric redshift bins. The 
data corresponding to each of the lens redshift bins is indicated in 
the legend, where for clarity we only plot the data from the source- 
lens bin combinations where the lens redshifts are in the trusted 
0.2 < z < 1.3 redshift range (see section[23|>. The lines are trends 



for 7t*"'' at ( 



2.0 10' 



1.5 10" 



^ 1.010' 



5.0 10' 



0.0 10" 




0.14 0.29 0.44 0.61 0.79 
mean photometric redshift sources 



1.06 



Figure 12. The mean tangential shear at an angular separation of one ar- 
cmin as function of the weighted mean source photometric redshift. The 
data points are obtained from the best-fitting SIS model of the data be- 
tween 0.8 and 4 arcmin. Different symbols correspond to different redshift 
lens subsamples (see legend). The lines correspond to a WMAP5 cosmol- 
ogy theoretical prediction of the change in the shear signal with an overall 
amplitude that is fixed by the best-fitting SIS velocity and allowing for the 
full P(z) distribution of the source redshifts. This yields a non-zero expec- 
tation for sources whose nominal photometric redshift places them in front 
of the photometric redshift of the lenses. 



expected from a WMAP5 cosmological model, incorporating the 
redshift probability distribution P{z) of lenses and sources as de- 
termined from the data. The amplitudes of the expected curves are 
adjusted by finding the value of that best fits the data for each 
lens subsample using all sources. For higher mean lens redshifts, 
the SIS velocity increases as expected for a flux-limited sample and 
we refer the reader to |Velander et al.H2012[ > for a detailed galaxy- 
galaxy lensing analysis to measure halo mass in CFHTLenS. 

The most striking feature of Figure [T2]is the significant signal 
detected from source galaxies with photometric redshifts z < 0.2. 
If the photometric redshift measure was accurate we would not ex- 
pect to measure a galaxy-galaxy lensing signal for any of the lens 
samples shown, as these sources would be in front of all the lenses. 
It is most likely that this signal results from the strong catastrophic 
errors measured for z < 0.1 in Hildebra ndt et al.| l|2012), where 
high redshift sources have been misclassified at low redshifts. The 
rise in the expected WMAP5 cosmological model for this low red- 
shift source bin shows that some of this catastrophic error infor- 
mation is contained in the photometric redshift probability distri- 
bution P{z) which is used to compute the model. But the data for 
this source redshift bin remains inconsistent with the models for the 
majority of lens samples, supporting our choice not to trust the data 
with photometric redshift estimates z < 0.2. 

With the exception of the data with z < 0.2, we find that 
we measure a redshift dependence of the galaxy-galaxy lensing 
signal that is consistent with a cosmological model for redshifts 
0.2 < z < 1.3. Figure [T2] therefore demonstrates the robustness 
of the CFHTLenS shear and redshift catalogues by confirming the 
absence of any significant redshift dependent biases in our method. 
The excellent agreement at the high redshifts that were contam- 
inated by systematic errors in an earlier CFHTLS analysis, (see 
|Kilbinger et al.|2009l for details), is particularly encouraging. Note 
that assuming an open cold dark matter model with Q,a = pro- 
duces a theoretical curve that is indistinguishable from the ACDM 
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curve shown on Figure [12] clarifying that this test is insensitive to 
the assumed fiducial cosmology. We therefore conclude that once 
the calibration and selection procedures outlined in this paper are 
incorporated, our weak lensing catalogues are of a very high quality 
and ready for the application of many varied cosmological studies. 



7 CONCLUSIONS 

This paper presents the Canada-France-Hawaii Telescope Lens- 
ing Survey (CFHTLenS) and is one out of four technical papers 
detailing and verifying the accuracy of each component of the 
CFHTLenS data analysis pipeline which is overviewed in Sec- 
tion|2] We dissect the pixel level anal ysis in|Erben et al. P012^ , the 
photometry and redshift analysis in Hildebrandt et al.'f2012'l, and 
the shear and image simulation analysis in Miller et al. (.2012 1. In 
this paper we focus on bringing all these key stages of the pipeline 
together to perform the final step of the analysis, identifying and 
removing any problematic data which contaminates the survey. As 
illustrated in Table [T| the complete pipeline has been re- written for 
CFHTLenS as every core stage of the standard pipeline used in 
previous analyses could have introduced low-level systematic er- 
rors. We therefore hope that the set of CFHTLenS technical papers 
will provide a detailed record of this work, acting as a reference 
to aid the future analyses of upcoming weak lensing surveys. We 
argue that for a science goal that is as demanding on the data anal- 
ysis as the study of weak gravitational lensing, every pipeline stage 
must be integrated and optimised for shape and photometry mea- 
surements. From the raw data to the shear and photometric redshift 
catalogues there are many stages where systematic errors can be in- 
troduced. As such we advocate that surveys take a holistic approach 
to their lensing analyses as we have for CFHTLenS. 

One of the most important developments during the 
CFHTLenS project resulted from using the lensfA shape measure- 
ment method's ability to optimally analyse individual exposures in 
comparison to stacked data. This improved the quality of the PSF 
model and hence the accuracy of the resulting shear measurement. 
That said, this particular advance cannot be identified as the sole 
solution to the systematic errors that concerned earlier analyses of 
CFHTLS (see a discussion of these errors in [Kilbinger et al.|2009| l. 
Indeed we conclude that it was most likely the combination of many 
small low-level effects, discussed in Section|2] that we have sought 
to identify at source and improve in this analysis. 

Whilst every effort has been made to develop a perfect end-to- 
end analysis tool, and significant improvements have been made, 
we still find ourselves in the unsatisfactory position of requiring 
the removal of 25% of the observed fields from our analysis (see 
Section [4~2| in addition to the application of small but significant 



calibration corrections (see [Miller et al.||2012| and Section 
This paper is primarily focused on the development (Section[3 i and 
application (Section [4| of a method to both calibrate and identify 
problematic data. By incorporating full cosmic shear N-body sim- 
ulations we are able to accurately determine the noise levels in a 
field by field, exposure level star-galaxy cross correlation analysis, 
in order to identify fields with a significant PSF residual. We have 
demonstrated how adept this new method is to identifying individ- 
ual fields where PSF residuals contaminate the signal. We have also 
shown how the resulting measured two-point cosmic shear signal 
is robust to the threshold criteria we set on this systematic test in 
Section [3] For comparison purposes we present the results of two 
standard systematic tests for the data, demonstrating how survey 
average tests are less sensitive to systematic error signals which 



become diluted when considered in a survey average, but are still 
present at a local level within the data. 

In Section |4.3| we presented a comprehensive search which 
found no evidence for any correlations between the 25% of fields 
which failed our systematics analysis and parameters which char- 
acterized the CFHT dome, telescope, camera and atmospheric con- 
ditions at the time of the observations. We are therefore unable to 
conclude whether this level of rejected data should be expected for 
all lensing surveys, all ground-based surveys, all lensfA analyses or 
all MegaCam observations. The CFHTLenS pipeline is currently in 
use to re-analyse the CFHT Red Sequence Cluster Survey (RCS2, 
see for example van Uitert et al. 2011), and analyse the CFHT i'- 
band imaging survey of the SDSS Stripe 82 region, the Next Gen- 
eration Virgo Survey (NGVS, [Ferrarese et al.|20T2l >, and the Kilo 
Degree Survey (KiDS) on the VLT Survey Telescope. It will be 
interesting to compare the relative performance of the pipeline be- 
tween these different surveys. 

We demonstrate that our shear and photometric redshift cat- 
alogues are 'science-ready', with a final cosmology insensitive 
galaxy-galaxy lensing test in Section [6] to confirm that the sur- 
vey is not subject to the strong redshift-dependent biases found in 
earlier analyses. These catalogues are being used for a variety of 
lensing analyses including the first set of CFHTLenS cosmolog- 
ical parameter constraints ^Kilbinger et al.|20I2[ [Benjamin et al.| 
[2012[ > with constraints on parameterised models of modifications 
to gravity ( [Simpson et al.|20T2) and constraints on the connection 
between galaxy properties and their parent dark matter haloes l [Ve-[ 
[lander et al.|2012[ l. With the community embarking on three major 
new weak lensing survey^ this is a very exciting time for the study 
of the 'Dark Universe' with weak gravitational lensing. We hope 
that the suite of detailed CFHTLenS technical papers will serve as 
a solid core upon which to build the hyper-precision weak lensing 
analyses of the future. 
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